(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
14 February 2002 (14.02.2002) 







PCT 



(10) International Publication Number 

WO 02/12294 A2 



(51) International Patent Classification 7 : C07K 14/315, 

A61K 39/09, C07K 16/12, C12N 5/12, A61K 39/40, 
C12N 15/12, 15/63, A61K 48/00, C12Q 1/68, G01N 
33/53, C07K 14/34 

(21) International Application Number: PCT7US0 1/24795 

(22) International Filing Date: 8 August 2001 (08.08.2001) 



(25) Filing Language: 



(26) Publication Language: 



English 
English 



(30) Priority Data: 

09/634,341 



8 August 2000 (08.08.2000) US 



(63) Related by continuation (CON) or continuation-in-part 
(CIP) to earlier application: 

US 09/634,341 (CON) 

Filed on 8 August 2000 (08.08.2000) 

(71) Applicants (for all designated States except US): ST. 
JUDE CHILDREN'S RESEARCH HOSPITAL 
[US/US]; 332 North Lauderdale Street, Memphis, TN 
38105 2794 (US). UNIVERSITY OF UTAH RE- 
SEARCH FOUNDATION [US/US]; 615 Arapeen Drive, 
Suite 10, Salt Lake City, UT 84108 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): ADDERSON, 



Elisabeth [CA/US]; 1041 Murray Hill Lane S., Memphis, 
TN 38120 (US). BOHNSACK, John [US/US]; 760 South 
1200 East, Salt Lake City, UT 84102 (US). 

(74) Agent: DIETZEL, Christine, E.; Klauber & Jackson, 411 
Hackensack Avenue, Hackensack, NJ 07601 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 

AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EE, ES, FT, GB, GD, GE, GH, GM, 
HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, 
LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, 
MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, 
TJ, TM, TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GQ, GW, ML, MR, NE, SN, TD, 
TG). 

Published: 

without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



_j (54) Title: GROUP B STREPTOCOCCUS POLYPEPTIDES NUCLEIC ACIDS AND THERAPEUTIC COMPOSITIONS AND 
VACCINES THEREOF 



< 



"^f (57) Abstract: This invention provides isolated nucleic acids encoding polypeptides comprising amino acid sequences of strepto- 
©N coccal matrix adhesion (Ema) polypeptides. The invention provides nucleic acids encoding Group B streptococcal Ema polypeptides 
EmaA, EmaB, EmaC, EmaD and EmaE. The present invention provides isolated polypeptides comprising amino acid sequences of 
Group B streptococcal polypeptides EmaA, EmaB, EmaC, EmaD and EmaE, including analogs, variants, mutants, derivatives and 
fragments thereof. Ema homologous polypeptides from additional bacterial species, including S. pneumoniae, S. pyogenes, E. fae- 
calis and C diptheriae are also provided. Antibodies to the Ema polypeptides and immunogenic fragments thereof are also provided. 
The present invention relates to the identification and prevention of infections by virulent forms of streptococci. This invention pro- 
vides pharmaceutical compositions, immunogenic compositions, vaccines, and diagnostic and therapeutic methods of use of the 
isolated polypeptides, antibodies thereto, and nucleic acids. Assays for compounds which modulate the polypeptides of the present 
invention for use in therapy are also provided. 



o 



WO 02/12294 



PCT/US01/24795 



GROUP B STREPTOCOCCUS POLYPEPTIDES NUCLEIC ACIDS AND 
THERAPEUTIC COMPOSITIONS AND VACCINES THEREOF 



GOVERNMENTAL SUPPORT 



The research leading to the present invention was supported, at least in part, by a grant 
from NATD J Grant No.A140918. Accordingly, the Government may have certain 
10 rights in the invention. 

FIELD OF THE INVENTION 



This invention relates generally to extracellular matrix adhesin (Ema) proteins, 
15 antibodies thereto and to vaccines, compositions and therapeutics. The Group B 
streptococcal Ema polypeptides are EmaA, EmaB, EmaC, EmaD and EmaE. The 
invention further relates to Ema polypeptides from various species of bacteria, 
including S. pneumoniae, S. pyogenes, E. faecalis and C. diptheriae. The invention 
also relates to the identification and prevention of infections by streptococci. Isolated 
20 nucleic acids encoding Group B streptococcal Ema polypeptides, particularly EmaA, 
EmaB, EmaC, EmaD and EmaE and to other bacterial Ema homologs are included 
herein. Assays for compounds which modulate the polypeptides of the present 
invention for use in therapy are also provided. 

25 BACKGROUND OF THE INVENTION 



Streptococci are catalase negative gram positive cocci. They may be classified by the 
type of hemolysis exhibited on blood agar, by the serologic detection of carbohydrate 
antigens, or by certain biochemical reactions. Medically important streptococci include 
30 Groups A, B, D, S. pneumoniae and the viridans group of streptococci. Lancefield 

type A (GroupA) Streptococcus pyogenes is an important human pathogen - the cause 
of streptococcal pharyngitis, impetigo and more severe infections such as bacteremia 
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and necrotizing fascitis. The immunologic sequelae of Group A Streptococcal 
infections are also important health problems - rheumatic carditis is the most common 
cause of acquired cardiac disease worldwide and post-streptococcal glomerulonephritis 
is a cause of hypertension and renal dysfunction. Group B Streptococcus agalactiae are 
5 the most common cause of serious bacterial infections in newborns, and important 
pathogens in pregnant women and nonpregnant adults with underlying medical 
problems such as diabetes and cardiovascular disease. Group D streptococci include 
the enterococci (Streptococcus faecalis and faecium) and the "nonenterococcal" Group 
D streptococci. Streptococcus pneumoniae (pneumococcus) is not classified by group 

10 in the Lancefield system. Pneumocopci are extremely important human pathogens, the 
most common cause of bacterial pneumonia, middle ear infections and meningitis 
beyond the newborn period. The viridans group of streptococci include S. milleri, S. 
mitis, S. sanguis and others. They cause bacteremia, endocarditis, and dental 
infections. Enterococci are important causes of urinary tract infections, bacteremia 

15 and wound infections (predominantly as nosocomial infections in hospitalized patients), 
and endocarditis. Over the past decade enterococci have developed resistance to many 
conventional antibiotics and there are some strains resistant to all known antibiotics. 

Group B streptococci (GBS) are the most common cause of serious bacterial disease 
20 in neonates, and are important pathogens in pregnant women and adults with 
underlying illnesses (Baker CJ. (2000) "Group B streptococcal infections" in 
Streptococcal infections. Clinical aspects, microbiology, and molecular 
pathogenesis. (D. L. Stevens and E. L. Kaplan), New York: Oxford University Press, 
222-237). Common manifestations of these infections include bacteremia, pneumonia, 
25 meningitis, endocarditis, and osteoarticular infections (Baker CJ. (2000) "Group B 
streptococcal infections" in Streptococcal infections. Clinical aspects, microbiology, 
and molecular pathogenesis. (D. L. Stevens and E. L. Kaplan), New York: Oxford 
University Press, 222-237; Blumberg H.M. et al. (1996) J Infect Dis 173:365-373). 
The incidence of invasive GBS disease is approximately 2.6 in 1000 live births and 7.7 
30 in 100,000 in the overall population, with mortality rates that vary from 6 to 30% 
(Baker CJ. (2000) "Group B streptococcal infections" in Streptococcal infections. 
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Clinical aspects, microbiology, and molecular pathogenesis. (D. L. Stevens and E. L. 
Kaplan), New York; Oxford University Press, 222-237; Blumberg H.M. et al. (1996) J 
Infect Dis 173:365-373). Although much neonatal disease is preventable by 
administration of prophylactic antibiotics to women in labor, antibiotic prophylaxis 
5 programs can be inefficient, suffer from poor compliance, or fail if antibiotic resistance 
emerges. No effective prophylaxis strategy for adult infections has been established. 

During childbirth, GBS can pass from the mother to the newborn. By one estimate, up 
' to 30% of pregnant women carry GBS at least temporarily in the vagina or rectum 

10 without symptoms. Infants born to these women become colonized with GBS during 
delivery (Baker, C.J. and Edwards, M.S. (1995) "Group B Streptococcal Infections" in 
Infectious Disease of the Fetus and Newborn Infant (J.S. Remington and J.O Klein), 
980-1054). Aspiration of infected amniotic fluid or vaginal secretions allow GBS to 
gain access to the lungs. Adhesion to, and invasion of, respiratory epithelium and 

15 endothelium appear to be critical factors in early onset neonatal infection. (Baker, C.J. 
and Edwards, M.S. (1995) "Group B Streptococcal Infections" in Infectious Disease 
of the Fetus and Newborn Infant (J.S. Remington and J.O Klein), 980-1054; Rubens, 
C.E. et al. (1991) J InfDis 164:320-330). Subsequent steps in infection, such as blood 
stream invasion and the establishment of metastatic local infections have not been 

20 clarified. The pathogenesis of neonatal infection occurring after the first week of life is 
also not well understood. Gastrointestinal colonization may be more important than a 
respiratory focus in late onset neonatal disease (Baker, C.J. and Edwards, M.S. (1995) 
"Group B Streptococcal Infections" in Infectious Disease of the Fetus and Newborn 
Infant (J.S. Remington and J.O Klein), 980-1054). Considerable evidence suggests that 

25 invasion of brain microvascular endothelial cells by GBS is the initial step in the 
pathogenesis of meningitis. GBS are able to invade human brain microvascular 
endothelial cells and type III GBS, which are responsible for the majority of meningitis, 
accomplish this 2-6 times more efficiently than other serotypes (Nizet, V. et al. (1997) 
Infect Immun 65:5074-5081), 
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Because GBS is widely distributed among the population and is an important pathogen 
in newborns, pregnant women are commonly tested for GBS at 35-37 weeks of 
pregnancy. Much of GBS neonatal disease is preventable by administration of 
prophylactic antibiotics during labor to women who test positive or display known risk 
5 factors. However, these antibiotics programs do not prevent all GBS disease. The 
programs are deficient for a number of reasons. First, the programs can be inefficient. 
Second, it is difficult to ensure that all healthcare providers and patients comply with 
the testing and treatment. And finally, if new serotypes or antibiotic resistance 
emerges, the antibiotic programs may fail altogether. Currently available tests for GBS 
10 are inefficient. These tests may provide false negatives. Furthermore, the tests are not 
specific to virulent strains of GBS. Thus, antibiotic treatment may be given 
unnecessarily and add to the problem of antibiotic resistance. Although a vaccine 
would be advantageous, none are yet commercially available. 

15 Traditionally, GBS are divided into 9 serotypes according to the immunologic 

reactivity of the polysaccharide capsule (Baker CJ. (2000) "Group B streptococcal 
infections" in Streptococcal infections. Clinical aspects, microbiology, and molecular 
pathogenesis. (D. L. Stevens and E. L. Kaplan), New York: Oxford University Press, 
222-237; Blumberg H.M. et al. (1996) J Infect Dis 173:365-373; Kogan, G. et al 

20 (1996) J Biol Chem 271:8786-8790). Serotype III GBS are particularly important in 
human neonates, causing 60-70% of all infections and almost all meningitis (Baker CJ. 
(2000) "Group B streptococcal infections" in Streptococcal infections. Clinical 
aspects, microbiology, and molecular pathogenesis. (D, L. Stevens and E. L. Kaplan), 
New York: Oxford University Press, 222-237). Type III GBS can be subdivided into 

25 three groups of related strains based on the analysis of restriction digest patterns 

(RDPs) produced by digestion of chromosomal DNA with Hind III and &re8387 (I. Y. 
Nagano et al. (1991) J Med Micro 35:297-303; S. Takahashi et al. (1998) JInfDis 
177:1116-1119). 

30 Over 90% of invasive type III GBS neonatal disease in Tokyo, Japan and in Salt Lake 
City, Utah is caused by bacteria from one of three RDP types, termed RDP type III-3, 
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while RDP type III-2 are significantly more likely to be isolated from vagina than from 
blood or CSF. These results suggest that this genetically-related cluster of type III-3 
GBS are more virulent than III-2 strains and could be responsible for the majority of 
invasive type III disease globally. 

5 

Preliminary vaccines for GBS used unconjuated purified polysaccaride. GBS poly - 
and oligosaccharides are poorly immunogenic and fail to elicit significant memory and 
booster responses. Baker et al immunized 40 pregnant women with purified serotype 
III capsular polysaccharide (Baker, C.J. et al. (1998) New Engl J of Med 

10 319:11 80- 1185). Overall, only 57% of women with low levels of specific antibody 

responded to the vaccine. The poor immunogenicity of purified polysaccharide antigen 
was further demonstrated in a study in which thirty adult volunteers were immunized 
with a tetravalent vaccine composed of purified polysaccharide from serotypes la, lb, 
II, and III (Kotloff, K.L. et al. (1996) Vaccine 14:446-450). Although safe, this 

15 vaccine was only modestly immunogenic, with only 13% of subjects responding to 
type lb, 17% to type II, 33% responding to type la, and 70% responding to type III 
polysaccharide. The poor immunogenicity of polysaccharide antigens prompted efforts 
to develop polysaccharide conjugate vaccines, whereby these poly - or 
oligosaccharides are conjugated to protein carriers. Ninety percent of healthy adult 

20 women immunized with a type III polysaccharide-tetanus toxoid conjugate vaccine 
responded with a 4-fold rise in antibody concentration, compared to 50% immunized 
with plain polysaccharide (Kasper, D.L. et al (1996) J of Clin Invest 98:2308-23 14). 
A type la/lb polysaccharide-tetanus toxoid conjugate vaccine was similarly more 
immunogenic in healthy adults than plain polysaccharide (Baker, C.J. et al (1999) J 

25 Infect Dis 179:142-150). 

The disadvantage of polysaccharide-protein conjugate vaccines is that the process of 
purifying and conjugating polysaccharides is difficult, time-consuming and expensive. 
A protein antigen which could be cheaply and easily produced would be an 
30 improvement. 
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If one were to make a polysaccharide-protein conjugate vaccine, a GBS-specific 
carrier protein may be preferable to one of the commonly used carriers such as tetanus 
or diphtheria toxoids because of the potential problems associated with some of these 
carrier proteins, particularly variable immunogenicity and the problems associated with 
5 repeated vaccination with the same carrier protein. Selection of appropriate carrier 
proteins is important for the development of polysaccharide-protein vaccine 
formulations. For example, Haemophilus influenzae type b poly- or oligosaccharide 

i 

conjugated to different protein carriers has variable immunogenicity and elicits 
antibody with varying avidity (Decker, M.D. et al (1992) J Pediatrics 120: 184-189; 

10 Schlesinger, Y. (1992) JAMA 267:1489-1494). Repeated immunization with the same 
carrier protein may also suppress immune responses by competition for specific B cells 
(epitopic suppression) or other mechanisms. This is of particular concern for the 
development of GBS vaccines since recently developed poly/oligosaccharide-protein 
conjugate vaccines against the bacteria H. influenzae, S. pneumoniae, and N. 

15 meningitidis all utilize a restricted number of carrier proteins (tetanus toxoid, 

CRM197, diptheria toxoid), increasing the number of exposures to these carriers an 
individual is likely to receive. Additionally, using tetanus as a carrier protein offers no 
specific advantage beyond the improved immunogenicity of the vaccine. A 
second-generation vaccine containing a GBS-specific carrier protein would enhance 

20 immunogenicity and have an advantage in that a GBS-specific immune response would 
be generated against both the carrier protein and the poly/oligosaccharide. 

Therefore, in view of the aforementioned deficiencies attendant with prior art vaccines 
and methods, it should be apparent that there still exists a need in the art for an 

25 effective and immunogenic GBS vaccine. The availability and use of a GBS 

polypeptide in a conjugate vaccine is desirable. A GBS polypeptide which is present 
or expressed in all GBS serotypes would have the added advantage of providing broad, 
general immunity across many GBS serotypes. It would be particularly relevant and 
useful to provide a streptococcal vaccine or immunogen which is expressed broadly in 

30 various streptococcal species, whereby broad or general immunity against multiple and 
unique groups of streptococci (for instance, Group A, Group B and S, pneumoniae), 
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particularly against distinct virulent and clinically relevant streptococcal bacteria, could 
thereby be generated. 

The citation of references herein shall not be construed as an admission that such is 
5 prior art to the present invention. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, streptococcal polypeptides termed 
10 extracellular matrix adhesins (Ema) pre provided which are particularly useful in the 
identification and prevention of infections by streptococci. 

In its broadest aspect, the present invention encompasses isolated polypeptides 
comprising an amino acid sequence of a streptococcal polypeptide selected from the . 
15 group of EmaA, EmaB, EmaC, EmaD and EmaE. The isolated peptides, including 
combinations of one or more thereof, are suitable for use in immunizing animals and 
humans against bacterial infection, particularly streptococci. 

The present invention is directed to an isolated streptococcal EmaA polypeptide which 
20 comprises the amino acid sequence set out in SEQ ID NO: 2, and analogs, variants and 
immunogenic fragments thereof. 

The present invention is directed to an isolated streptococcal EmaB polypeptide which 
comprises the amino acid sequence set out in SEQ ID NO: 4, and analogs, variants and 
25 immunogenic fragments thereof. . 

The present invention is directed to an isolated streptococcal EmaC polypeptide which 
comprises the amino acid sequence set out in SEQ ID NO: 6, and analogs, variants and 
immunogenic fragments thereof. 
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The present invention is directed to an isolated streptococcal EmaD polypeptide which 
comprises the amino acid sequence set out in SEQ ID NO: 8, and analogs, variants and 
immunogenic fragments thereof 

5 The present invention is directed to an isolated streptococcal EmaE polypeptide which 
comprises the amino acid sequence set out in SEQ ID NO: 10, and analogs, variants 
and immunogenic fragments thereof. 

The present invention also provides Ema polypeptide homologs from distinct bacterial 
10 species, particularly including distinct streptococcal species, more particularly 

including Group B streptococcus, Group A streptococcus (particularly S. pyogenes) 
and S. pneumoniae. The present invention also provides Ema polypeptides from 
additional distinct bacterial species, particularly including Enterococcus faecalis and 
Cotynebacterium diptheriae. Nucleic acids encoding Ema polypeptide homologs from 
15 distinct bacterial species are also provided. 

The present invention thus provides an isolated streptococcal Ema polypeptide 
comprising the amino acid sequence set out in SEQ ID NO:23. An isolated nucleic 
acid which encodes the streptococcal polypeptide set out in SEQ ID NO:23 is further 
20 provided. 

The invention thus further provides an isolated streptococcal Ema polypeptide 
comprising the amino acid sequence set out in SEQ ID NO:26. An isolated nucleic 
acid which encodes the streptococcal polypeptide set out in SEQ ID NO:26 is further 
25 provided. 

The present invention further provides an isolated streptococcal Ema polypeptide 
comprising the amino acid sequence set out in SEQ ED NO: 3 7. An isolated nucleic 
acid which encodes the streptococcal polypeptide set out in SEQ ID NO:37 is further 
30 provided. 
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An enterococcal Ema polypeptide is further provided comprising the amino acid 
sequence set out in SEQ ID NO:29. An isolated isolated nucleic acid which encodes 
the enterococcal polypeptide set out in SEQ ED NO:29 is also provided. 

5 The invention provides an isolated Corynebacterium Ema polypeptide comprising the 
amino acid sequence set out in SEQ ID NO: 32. Also provided is an isolated nucleic 
acid which encodes the Corynebacterium polypeptide set out in SEQ ID NO: 32. 

The invention provides an isolated bacterial polypeptide comprising the amino acid 
10 sequence TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34), wherein the 
polypeptide is not isolated from Actinomyces. 

The invention further provides an isolated streptococcal polypeptide comprising the 
amino acid sequence TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34). 

15 

Also provided is an isolated bacterial polypeptide comprising the amino acid sequence 
TLVTCTPYGINTHRLLVTA (SEQ ID NO: 35). 

The present invention includes an isolated bacterial polypeptide comprising the amino 
20 acid sequence TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36). An isolated 
streptococcal polypeptide comprising the amino acid sequence 
TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36) is also provided. 

The invention further includes an isolated polypeptide having the amino acid sequence 
25 selected from the group of TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34), 

TLVTCTPYGINTHRLLVTA (SEQ ID NO: 35), and TLVTCTPYGVNTKRLLVRG 
(SEQ ID NO: 36). 

The present invention contemplates the use of the polypeptides of the present invention 
30 in diagnostic tests and methods for determining and/or monitoring of streptococcal 
infection. Thus, the present invention provides an isolated Ema polypeptide, 
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particularly selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, 
labeled with a detectable label. 

In the instance where a radioactive label, such as the isotopes 3 H, 14 C, 32 P, 35 S, 36 C1, 
5 51 Cr, 57 Co, 58 Co, 59 Fe, 90 Y, l25 I, l31 I, and 186 Re are used, known currently available 
counting procedures may be utilized. In the instance where the label is an enzyme, 
detection may be accomplished by any of the presently utilized colorimetric, 
spectrophotometric, fluorospectrophotometric, amperometric or gasometric techniques 
known in the art. 

10 

The present invention extends to an immunogenic Ema polypeptide, particularly 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, or a fragment 
thereof. The present invention also extends to immunogenic Ema polypeptides 
wherein such polypeptides comprise a combination of at least one immunogenic Ema 
15 polypeptide, selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, or 
immunogenic polypeptide fragment thereof, and a GBS polypeptide selected from the 
group of Spbl, Spb2, C protein alpha antigen, Rib, Lmb, C5a-ase, or immunogenic 
fragments thereof. 

r 

20 In a further aspect, the present invention extends to vaccines based on the Ema 

proteins described herein. The present invention provides a vaccine comprising one or 
more streptococcal polypeptides selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE, and a pharmaceutically acceptable adjuvant. The present invention 
provides a vaccine comprising one or more streptococcal polypeptides selected from 

25 the group of the polypeptide of SEQ ID NO: 23, 26, and 37, and a pharmaceutically 
acceptable adjuvant. 

The present invention further provides a streptococcal vaccine comprising one or more 
Group B streptococcal polypeptides selected from the group of EmaA, EmaB, EmaC, 
30 EmaD and EmaE, further comprising one or more additional streptococcal antigens. 
The present invention further provides a GBS vaccine comprising one or more Group 
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B streptococcal polypeptides selected from the group of EmaA, EmaB, EmaC, EmaD 
and EmaE, further comprising one or more additional GBS antigens. In a particular 
embodiment, the GBS antigen is selected from the group of the polypeptide Spbl or an 
immunogenic fragment thereof, the polypeptide Spb2 or an immunogenic fragment 
5 thereof, C protein alpha antigen or an immunogenic fragment thereof, Rib or an 

immunogenic fragment thereof Lmb or an immunogenic fragment thereof, C5a-ase or 
an immunogenic fragment thereof and Group B streptococcal polysaccharides or 
oligosaccharides. 

10 In another aspect, the invention is directed to a vaccine for protection of an animal 

subject from infection with streptococci comprising an immunogenic amount of one or 
more Ema polypeptide EmaA, EmaB, EmaC, EmaD or EmaE, or a derivative or 
fragment thereof. Such a vaccine may contain the protein conjugated covalently to a 
GBS bacterial polysaccharide or oligosaccharide or polysaccharide or oligosaccharide 

15 from one or more GBS serotypes. 

In a still further aspect, the present invention provides an immunogenic composition 
comprising one of more streptococcal polypeptides selected from the group of EmaA, 
EmaB, EmaC, EmaD and EmaE, and a pharmaceutically acceptable adjuvant. 

20 

The present invention further provides an immunogenic composition comprising one or 
more Group B streptococcal polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE, further comprising one or more antigens selected from the 
group of the polypeptide Spbl or an immunogenic fragment thereof, the polypeptide 
25 Spb2 or an immunogenic fragment thereof, C protein alpha antigen or an immunogenic 
fragment thereof, Rib or an immunogenic fragment thereof Lmb or an immunogenic 
fragment thereof, C5a-ase or an immunogenic fragment thereof, and Group B 
streptococcal polysaccharides or oligosaccharides. 

30 The invention further provides pharmaceutical compositions, vaccines, and diagnostic 
and therapeutic methods of use thereof. 
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The invention provides pharmaceutical compositions comprising a bacterial Ema 
polypeptide and a pharmaceutically acceptable carrier. The invention provides 
pharmaceutical compositions comprising a streptococcal polypeptide selected from the 

4 

group of EmaA, EmaB, EmaC, EmaD and EmaE, the polypeptide of SEQ ED NO: 23, 
5 the polypeptide of SEQ ID NO: 26, the polypeptide of SEQ ID NO:37, and a 
pharmaceutically acceptable carrier. The invention provides pharmaceutical 
compositions comprising a streptococcal polypeptide selected from the group of 
EmaA, EmaB, EmaC, EmaD and EmaE, and a pharmaceutically acceptable carrier. 
The present invention further provides pharmaceutical compositions comprising one or 
10 more GBS Ema polypeptide, or a fragment thereof, in combination with one or more 
of GBS polypeptide Spbl, Spb2, C protein alpha antigen, Rib, Lmb, C5a-ase, a Group 
B streptococcal polysaccharide or oligosaccharide vaccine, and an anti-streptococcal 
vaccine. 

15 In a still further aspect, the present invention provides a purified antibody to a 

streptococcal polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE. In a still further aspect, the present invention provides a purified antibody to a 
streptococcal polypeptide selected from the group of the polypeptide of SEQ ID 
NO:23, the polypeptide of SEQ ID NO: 26, and the polypeptide of SEQ ID NO:37. 

20 

Antibodies against the isolated polypeptides of the present invention include naturally 
raised and recombinantly prepared antibodies. These may include both polyclonal and 
monoclonal antibodies prepared by known genetic techniques, as well as bi-specific 
(chimeric) antibodies, and antibodies including other functionalities suiting them for 

25 diagnostic use. Such antibodies can be used in immunoassays to diagnose infection 
with a particular strain or species of bacteria. The antibodies can also be used for 
passive immunization to treat an infection with streptococcal bacteria including Group 
B streptococcus, Group A streptococcus, and S. pneumoniae. These antibodies may 
also be suitable for modulating bacterial adherence and/or invasion including but not 

30 limited to acting as competitive agents. 



4 
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The present invention provides a monoclonal antibody to a streptococcal polypeptide 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. The invention 
thereby extends to an immortal cell line that produces a monoclonal antibody to a 
streptococcal poypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and 
5 EmaE. 

An antibody to a streptococcal Ema polypeptide EmaA, EmaB, EmaC,. EmaD or 
EmaE labeled with a detectable label is further provided. In particular embodiments, 
the label may selected from the group consisting of an enzyme, a chemical which 
10 fluoresces, and a radioactive element. 

The present invention provides a pharmaceutical composition comprising one or more 
antibodies to a streptococcal protein selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE, and a pharmaceutically acceptable carrier. The invention further 

15 provides a pharmaceutical composition comprising a combination of at least two 
antibodies to Group B streptococcal proteins and a pharmaceutically acceptable 
carrier, wherein at least one antibody to a protein selected from the group of EmaA, 
EmaB, EmaC, EmaD, and EmaE is combined with at least one antibody to a protein 
selected from the group of Spbl, Spb2, Rib, Lmb, C5a-ase and a C protein alpha 

20 antigen. . 

The present invention also relates to isolated nucleic acids, such as recombinant DNA 
molecules or cloned genes, or degenerate variants thereof, mutants, analogs, or 
fragments thereof, which encode the isolated polypeptide of the present invention or 

25 which competitively inhibit the activity of the polypeptide. The present invention 

further relates to isolated nucleic acids, such as recombinant DNA molecules or cloned 
genes, or degenerate variants thereof, mutants, analogs, or fragments thereof, which 
encode a bacterial Ema polypeptide. The present invention further relates to isolated 
nucleic acids, such as recombinant DNA molecules or cloned genes, or degenerate 

30 variants thereof, mutants, analogs, or fragments thereof, which encode a streptococcal 
Ema polypeptide. The present invention further relates to isolated nucleic acids, such 
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as recombinant DNA molecules or cloned genes, or degenerate variants thereof, 
mutants, analogs, or fragments thereof, which encode a streptococcal Ema 
polypeptide, particularly selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE. Preferably, the isolated nucleic acid, which includes degenerates, variants, 
5 mutants, analogs, or fragments thereof, has a sequence as set forth in SEQ ID NOS: 1, 
3, 5, 7 or 9. In a further embodiment of the invention, the DNA sequence of the 
recombinant DNA molecule or cloned gene may be operatively linked to an expression 
control sequence which may be introduced into an appropriate host. The invention 
accordingly extends to unicellular hosts transformed with the cloned gene or 
10 recombinant DNA molecule comprising a DNA sequence encoding an Ema protein, 
particularly selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, and 
more particularly, the DNA sequences or fragments thereof determined from the 
sequences set forth above. 

15 In a particular embodiment, the nucleic acid encoding the EmaA polypeptide has the 
sequence selected from the group comprising SEQ ID NO:l; a sequence that 
hybridizes to SEQ ID NO: 1 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO: 1 or 
a sequence that hybridizes to SEQ ID NO: 1 under moderate stringency hybridization 

20 conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof In a particular embodiment, the nucleic acid encoding the EmaA polypeptide 
has the sequence selected from the group comprising SEQ ID NO: 1; a sequence 
complementary to SEQ ID NO: 1 ; or a homologous sequence which is substantially 
similar to SEQ ID NO: 1. In a further embodiment, the nucleic acid has the sequence 

25 consisting of SEQ ID NO : 1 . 

In a particular embodiment, the nucleic acid encoding the EmaB polypeptide has the 
sequence selected from the group comprising SEQ ID NO:3; a sequence that 
hybridizes to SEQ ID NO:3 under moderate stringency hybridization conditions; DNA 
30 sequences capable of encoding the amino acid sequence encoded by SEQ ID NO: 3 or 
a sequence that hybridizes to SEQ ID NO: 3 under moderate stringency hybridization 
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conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof. In a particular embodiment, the nucleic acid encoding the EmaB polypeptide 
has the sequence selected from the group comprising SEQ ID NO:3; a sequence 
complementary to SEQ ID NO:3; or a homologous sequence which is substantially 
5 similar to SEQ ID NO:3. In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID NO:3. 

In a particular embodiment, the nucleic acid encoding the EmaC polypeptide has the 
sequence selected from the group comprising SEQ ID NO: 5; a sequence that 

10 hybridizes to SEQ ID NO: 5 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO: 5 or 
a sequence that hybridizes to SEQ ID NO: 5 under moderate stringency hybridization 
conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof In a particular embodiment, the nucleic acid encoding the EmaC polypeptide 

15 has the sequence selected from the group comprising SEQ ID NO: 5; a sequence 
complementary to SEQ ID NO: 5; or a homologous sequence which is substantially 
similar to SEQ ID NO:5. In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID NO:5. 

, 20 In a particular embodiment, the nucleic acid encoding the EmaD polypeptide has the 
sequence selected from the group comprising SEQ ID NO: 7; a sequence that 
hybridizes to SEQ ID NO: 7 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO: 7 or 
a sequence that hybridizes to SEQ ID NO: 7 under moderate stringency hybridization 

25 conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof In a particular embodiment, the nucleic acid encoding the EmaD polypeptide 
has the sequence selected from the group comprising SEQ ID NO: 7; a sequence 
complementary to SEQ ED NO: 7; or a homologous sequence which is substantially 
similar to SEQ ID NO:7. In a further embodiment, the nucleic acid has the sequence 

30 consisting of SEQ ID NO:7. 
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In a particular embodiment, the nucleic acid encoding the EmaE polypeptide has the 
sequence selected from the group comprising SEQ ID NO: 9; a sequence that 
hybridizes to SEQ ID NO: 9 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO. 9 or 
5 a sequence that hybridizes to SEQ ID NO: 9 under moderate stringency hybridization 
conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof In a particular embodiment, the nucleic acid encoding the EmaE polypeptide 
has the sequence selected from the group comprising SEQ ID NO. 9; a sequence 
complementary to SEQ ID NO:9; or a homologous sequence which is substantially 
10 similar to SEQ ID NO: 9 In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID NO:9. 

In a further embodiment, the nucleic acid encoding the bacterial Ema polypeptide 
comprises the sequence selected from the group comprising SEQ ID NO: 24, 27, 30 
15 and 33. In a further embodiment, the nucleic acid encoding the bacterial Ema 

polypeptide has the sequence selected from the group comprising SEQ ID NO : 24, 27, 
30 and 33. 

A nucleic acid capable of encoding a streptococcal polypeptide EmaA, EmaB, EmaC, 
20 EmaD or EmaE which is a recombinant DNA molecule is further provided. Such a 
recombinant DNA molecule wherein the DNA molecule is operatively linked to an 
expression control sequence is also provided herein. 

- The present invention relates to nucleic acid vaccines or DNA vaccines comprising 
25 nucleic acids encoding immunogenic streptococcal Ema polypeptides, particularly 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. The present 
invention relates to nucleic acid vaccines or DNA vaccines comprising nucleic acids 
encoding one or more immunogenic Ema polypeptide or a fragment thereof or any 
combination of one or more Ema polypeptide EmaA, EmaB, EmaC, EmaD or EmaE 
30 with at least one other polypeptide, particularly a GBS polypeptide, more particularly 
wherein said other GBS polypeptide is selected from the group of Spbl, Spb2, C 
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protein alpha antigen, Rib, Lmb, C5a-ase, and immunogenic polypeptide fragments 
thereof. 

The invention further relates to a vaccine for protection of an animal subject from 
5 infection with a streptococcal bacterium comprising a vector containing a gene 
encoding an Ema polypeptide selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE operatively associated with a promoter capable of directing 
expression of the gene in the subject. The present invention further provides a nucleic 
acid vaccine comprising a recombinant DNA molecule capable of encoding a GBS 
10 polypeptide EmaA, EmaB, EmaC, EmaD or EmaE. 

The invention further relates to a vaccine for protection of an animal subject from 
infection with a Group B streptococcal bacterium comprising a vector containing a 
gene encoding an Ema polypeptide selected from the group of EmaA, EmaB, EmaC, 
15 EmaD and EmaE operatively associated with a promoter capable of directing 

expression of the gene in the subject. The present invention further provides a nucleic 
acid vaccine comprising a recombinant DNA molecule capable of encoding a GBS 
polypeptide EmaA, EmaB, EmaC, EmaD or EmaE. 

20 The present invention provides a vector which comprises the nucleic acid capable of 
encoding encoding an Ema polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE and a promoter. The present invention provides a vector 
which comprises the nucleic acid of any of SEQ ID NO: 1, 3, 5, 7 or 9 and a 
promoter. The invention contemplates a vector wherein the promoter comprises a 

25 bacterial, yeast, insect or mammalian promoter. The invention contemplates a vector 
wherein the vector is a plasmid, cosmid, yeast artificial chromosome (YAC), 
bacteriophage or eukaryotic viral DNA. 

The present invention further provides a host vector system for the production of a 
30 polypeptide which comprises the vector capable of encoding an Ema polypeptide, 
particularly selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE in a 



WO 02/12294 



PCT/US01/24795 



18 

suitable host cell. A host vector system is provided wherein the suitable host cell 
comprises a prokaryotic or eukaryotic cell. A unicellular host transformed with a 
recombinant DNA molecule or vector capable of encoding encoding an Ema 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE is 
5 thereby provided. 

The present invention includes methods for determining and monitoring infection by 
streptococci by detecting the presence of a streptococcal polypeptide selected from the 
group of EmaA, EmaB, EmaC, EmaD and EmaE. In a particular such method, the 
10 streptococcal Ema polypeptide is measured by: 

a. contacting a sample in which the presence or activity of a 
Streptococcal polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE is suspected with an antibody to the said 
streptococcal polypeptide under conditions that allow binding of the 

15 streptococcal polypeptide to the antibody to occur; and 

b. detecting whether binding has occurred between the streptococcal 
polypeptide from the sample and the antibody; 

> 

20 wherein the detection of binding indicates the presence or activity of the streptococcal 
polypeptide in the sample. 

The present invention includes methods for determining and monitoring infection by 
streptococci by detecting the presence of a streptococcal polypeptide selected from the 
25 group of EmaA, EmaB, EmaC, EmaD and EmaE. In a particular such method, the 
streptococcal Ema polypeptide is measured by: 

a. contacting a sample in which the presence or activity of a 

Streptococcal polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE is suspected with an antibody to the said 
30 streptococcal polypeptide under conditions that allow binding of the 

streptococcal polypeptide to the antibody to occur; and 
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b. detecting whether binding has occurred between the streptococcal 
polypeptide from the sample and the antibody; 

wherein the detection of binding indicates the presence or activity of the 
5 streptococcal polypeptide in the sample. 

The present invention includes methods for determining and monitoring infection by 
Group B streptococci by detecting the presence of a Group B streptococcal 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. In a 
10 particular such method, the streptococcal Ema polypeptide is measured by: 

a. contacting a sample in which the presence or activity of a Group B 
streptococcal polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE is suspected with an antibody to the said 
Group B streptococcal polypeptide under conditions that allow binding 

15 of the Group B streptococcal polypeptide to the antibody to occur; and 

b. detecting whether binding has occurred between the Group B 
streptococcal polypeptide from the sample and the antibody; 

20 

wherein the detection of binding indicates the presence or activity of the Group B 
streptococcal polypeptide in the sample. 

The present invention further provides a method for detecting the presence of a 
25 bacterium having a gene encoding a streptococcal polypeptide selected from the group 
of emaA, emaB, emaC, emaD and emaE, comprising: 

a. contacting a sample in which the presence or activity of the bacterium 
is suspected with an oligonucleotide which hybridizes to a 
streptococcal polypeptide gene selected from the group of emaA, 
30 emaB, emaC, emaD and emaE, under conditions that allow specific 

hybridization of the oligonucleotide to the gene to occur; and 
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b. detecting whether hybridization has occurred between the 
oligonucleotide and the gene; 

wherein the detection of hybridization indicates that presence or activity of the 
5 bacterium in the sample. 

The invention includes an assay system for screening of potential compounds effective 
to modulate the activity of a streptococcal protein EmaA, EmaB, EmaC, EmaD or 
EmaE of the present invention. In one instance, the test compound, or an extract 

10 containing the compound, could be administered to a cellular sample expressing the 
particular Ema protein to determine the compound's effect upon the activity of the 
protein by comparison with a control. In a further instance the test compound, or an 
extract containing the compound, could be administered to a cellular sample 
expressing the Ema protein to determine the compound's effect upon the activity of 

15 the protein, and thereby on adherence of said cellular sample to host cells, by 
comparison with a control. 

It is still a further object of the present invention to provide a method for the 
prevention or treatment of mammals to control the amount or activity of streptococci, 
20 so as to treat or prevent the adverse consequences of invasive, spontaneous, or 
idiopathic pathological states. 

It is still a further object of the present invention to provide a method for the 
prevention or treatment of mammals to control the amount or activity of Group B 
streptococci, so as to treat or prevent the adverse consequences of invasive, 
25 spontaneous, or idiopathic pathological states. 

The invention provides a method for preventing infection with a bacterium that 
expresses a streptococcal Ema polypeptide comprising administering an 
immunogenically effective dose of a vaccine comprising an Ema polypeptide selected 
30 from the group of EmaA, EmaB, EmaC, EmaD and EmaE to a subject. 
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The invention further provides a method for preventing infection with a bacterium that 
expresses a Group B streptococcal Ema polypeptide comprising administering an 
immunogenically effective dose of a vaccine comprising an Ema polypeptide selected 
from the group of Ema A, EmaB, EmaC, EmaD and EmaE to a subject. 
5 . 

The present invention is directed to a method for treating infection with a bacterium 
that expresses a streptococcal Ema polypeptide comprising administering a 
therapeutically effective dose of a pharmaceutical composition comprising an Ema 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, and a 
10 pharmaceutically acceptable carrier to a subject. 

The invention further provides a method for treating infection with a bacterium that 
expresses a streptococcal Ema polypeptide comprising administering a therapeutically 
effective dose of a pharmaceutical composition comprising an antibody to an Ema 
15 polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, and a 
pharmaceutically acceptable carrier to a subject. 

In a further aspect, the invention provides a method of inducing an immune response 
in a subject which has been exposed to or infected with a streptococcal bacterium 
20 comprising administering to the subject an amount of the pharmaceutical composition 
comprising an Ema polypeptide selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE, and a pharmaceutically acceptable carrier, thereby inducing an 
immune response. 

25 The invention still further provides a method for preventing infection by a 

streptococcal bacterium in a subject comprising administering to the subject an amount 
of a pharmaceutical composition comprising an antibody to an Ema polypeptide 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE and a 
pharmaceutically acceptable carrier or diluent, thereby preventing infection by a 

30 streptococcal bacterium. 
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In a further aspect, the invention provides a method of inducing an immune response 
in a subject which has been exposed to or infected with a Group B streptococcal 
bacterium comprising administering to the subject an amount of the pharmaceutical 
composition comprising an Ema polypeptide selected from the group of EmaA, EmaB, 
5 EmaC, EmaD and EmaE, and a pharmaceutically acceptable carrier, thereby inducing 
an immune response. 

The invention still further provides a method for preventing infection by a Group B 
streptococcal bacterium in a subject comprising administering to the subject an amount 
10 of a pharmaceutical composition comprising an antibody to an Ema polypeptide 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE and a 
pharmaceutically acceptable carrier or diluent, thereby preventing infection by a 
streptococcal bacterium. 

15 The invention further provides an ema mutant bacteria which is non-adherent and/or 
non-invasive to cells, particularly which is mutated in one or more genes selected from 
the group of ema A, emaB, emaC, emaD and emaE. Particularly, such ema mutant is a 
streptococcal bacteria. More particularly, such ema mutant is a Group B 
streptococcal bacteria. Such non-adherent and/or non-invasive ema mutant bacteria 

20 can further be utilized in expressing other immunogenic or therapeutic proteins for the 
purposes of eliciting immune responses to any such other proteins in the context of 
vaccines and in other forms of therapy. 

Other objects and advantages will become apparent to those skilled in the art from a 
25 review of the following description which proceeds with reference to the following 
illustrative drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 



30 



FIGURE 1 depicts the restriction digest pattern (RDP) type III-3 specific probes. 
Dot blot hybridization of probe DY1-1 with genomic DNA isolated from type III 
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GBS. 10 ug of genomic DNA from each of 62 type III GBS strains was transferred to 
nylon membrane. Radiolabeled probe DY1-1 hybridized with DNA from all III-3 
strains (rows A-D) including the original type III-3 strain (well E-l). The probe failed 
to hybridize with DNA from III-2 strains (Fl- F10, Gl-7) including the original strain 
5 used in the subtraction hybridization (well E 10) and III-l strains (wells HI -3; cf. 
Figure 3). The same pattern of hybridization was observed using probe DY1-1 1 . 

* 

FIGURE 2 depicts the nucleic acid and predicted amino acid sequence of emaA . 
10 FIGURE 3 depicts the nucleic acid ^nd predicted amino acid sequence of emaB. 
FIGURE 4 depicts the nucleic acid and predicted amino acid sequence of emaC. 
FIGURE 5 depicts the nucleic acid and predicted amino acid sequence of emaD. 

15 

FIGURE 6 A-D depicts the nucleic acid and predicted amino acid sequence of emaE. 

DETAILED DESCRIPTION 

20 The present invention provides novel Group B streptococcal Ema polypeptides and 
their Ema homologs in distinct bacterial species, including distinct streptococcal 
species. The present invention relates to novel streptococcal Ema polypeptides, 
particularly selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, and 
fragments thereof. Nucleic acids encoding Ema polypeptides, and diagnostic and 

25 therapeutic compositions and methods based thereon for identification and prevention 
of infections by virulent forms of streptococci are provided. In particular, the present 
invention includes Group B streptococcal Ema polypeptides. The invention further 
includes polypeptide homologs of the GBS Ema polypeptides, particularly 
streptococcal homologs, more particularly Ema homologs of S. pneumoniae and S. 

30 pyogenes. Bacterial Ema polypeptide homologs in E. faecalis and C diptheriae are 
also provided. 
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Polypeptides 

The present invention is directed to an isolated polypeptide comprising an amino acid 
5 sequence of a bacterial Ema polypeptide. Bacterial Ema polypepties are provided 
from streptococcus, enterococcus and corynebacterium. The present invention is 
particularly directed to an isolated polypeptide comprising an amino acid sequence of a 
streptococcal Ema polypeptide selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE. The present invention is particularly directed to an isolated 
10 polypeptide comprising an amino acid sequence of a Group streptococcal Ema 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. 
Additional S. pneumoniae and S. pyogenes Ema polypeptides are included in the 
invention. E. faecalis and C. diptheriae Ema polypeptides are also included in the 
invention. 

15 

The polypeptides of the present invention are suitable for use in immunizing animals 
broadly against streptococcal infection. The polypeptides of the present invention are 
suitable for use in immunizing animals broadly against Group B, Group A, and S. 
pneumoniae streptococcal infection. The polypeptides of the present invention are 
20 suitable for use in immunizing animals against Group B streptococci. These 
polypeptide or peptide fragments thereof, when formulated with an appropriate 
adjuvant, are used in vaccines for protection against streptococci, particularly Group B 
streptococci, and against other bacteria with cross-reactive proteins. 

25 GBS proteins with streptococcal homologs outside of Group B have been previously 
identified (Lachenauer CS andMadoff LC ( 1 997) Adv Exp Med Biol. 418:615-8; 
Brady L.J. et al (1991) Infect Immun 59(12):4425-35; Stahlhammer-Carlemalm M. et 
al (2000) J Infect Dis 182(1):142-129). Stahlhammer-Carlemalm et al have 
demonstrated cross-protection between Group A and Group B streptococci due to 

30 cross-reacting surface proteins (Stahlhammer-Carlemalm M. et al (2000) J Infect Dis 
182(1): 142-129). The R28 protein of group A streptococcus (GAS) and the Rib 
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protein of group B streptococcus (GBS) are surface molecules that elicit protective 
immunity to experimental infection. These proteins are members of the same family 
and cross-react immunologically. In spite of extensive amino acid residue identity, the 
cross-reactivity between R28 and Rib was found to be limited, as shown by analysis 
5 with highly purified proteins and specific antisera. Nevertheless, immunization of mice 
with purified R28 conferred protection against lethal infection with Rib-expressing 
GBS strains, and immunization with Rib conferred protection against R28-expressing 
GAS. Thus, R28 and Rib elicited cross-protective immunity. 

10 The present invention is directed to an isolated streptococcal EmaA polypeptide which 
comprises the amino acid sequence set out in SEQ ID NO: 2, and analogs, variants 
and immunogenic fragments thereof. 

The present invention is directed to an isolated streptococcal EmaB polypeptide which 
15 comprises the amino acid sequence set out in SEQ ID NO: 4, and analogs, variants 
and immunogenic fragments thereof. 

The present invention is directed to an isolated streptococcal EmaC polypeptide which 
comprises the amino acid sequence set out in SEQ ID NO: 6, and analogs, variants 
20 and immunogenic fragments thereof 

The present invention is directed to an isolated streptococcal EmaD polypeptide which 
comprises the amino acid sequence set out in SEQ ID NO: 8, and analogs, variants 
and immunogenic fragments thereof. 

25 

The identity or location of one or more amino acid residues may be changed or 
modified to include variants such as, for example, deletions containing less than all of 
the residues specified for the protein, substitutions wherein one or more residues 
specified are replaced by other residues and additions wherein one or more amino acid 
30 residues are added to a terminal or medial portion of the polypeptide. These 

molecules include: the incorporation of codons "preferred" for expression by selected 
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non-mammalian hosts; the provision of sites for cleavage by restriction endonuclease 
enzymes; and the provision of additional initial, terminal or intermediate DNA 
sequences that facilitate construction of readily expressed vectors. 

5 The present invention is directed to an isolated Group B streptococcal EmaE 

polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 10, and 
analogs, variants and immunogenic fragments thereof. 

The present invention thus provides an isolated streptococcal Ema polypeptide 
10 comprising the amino acid sequence, set out in SEQ ID NO:23. An isolated nucleic 
acid which encodes the streptococcal polypeptide set out in SEQ ID NO:23 is further 
provided. 

The invention thus further provides an isolated streptococcal Ema polypeptide 
15 comprising the amino acid sequence set out in SEQ ID NO:26. An isolated nucleic 
acid which encodes the streptococcal polypeptide set out in SEQ ID N0.26 is further 
provided. 

The present invention further provides an isolated streptococcal Ema polypeptide 
20 comprising the amino acid sequence set out in SEQ ID NO: 3 7. An isolated nucleic 
acid which encodes the streptococcal polypeptide set out in SEQ ID NO:37 is further 
provided. 

An enterococcal Ema polypeptide is further provided comprising the amino acid 
25 sequence set out in SEQ ID NO:29. An isolated isolated nucleic acid which encodes 
the enterococcal polypeptide set out in SEQ ID NO:29 is also provided. 

The invention provides an isolated Corynebacterhim Ema polypeptide comprising the 
amino acid sequence set out in SEQ ID NO: 32. Also provided is an isolated nucleic 
30 acid which encodes the Corynebacterium polypeptide set out in SEQ ID NO: 32. 
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The invention provides an isolated bacterial polypeptide comprising the amino acid 
sequence TLLTCTPYMTNS/TEniLLVRyKG (SEQ ID NO: 34), wherein the 
polypeptide is not isolated from Actinomyces. 

5 The invention further provides an isolated streptococcal polypeptide comprising the 
amino acid sequence TLLTCTPYMINS/THRLLVR/KG (SEQ ED NO: 34). 

Also provided is an isolated bacterial polypeptide comprising the amino acid sequence 
TLVTCTPYGINTHRLLVTA (SEQ ID NO: 35). 

10 

« 

The present invention includes an isolated bacterial polypeptide comprising the amino 
acid sequence TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36). An isolated 
streptococcal polypeptide comprising the amino acid sequence 
TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36) is also provided. 

15 

The invention further includes an isolated polypeptide having the amino acid sequence 
selected from the group of TLLTCTP YM1NS/THRLLVR/KG (SEQ ID NO: 34), 
TLVTCTPYGINTHRLLVTA (SEQ ID NO: 35), and TLVTCTPYGVNTKRLLVRG 
(SEQ ID NO: 36). 

20 

The present invention contemplates the use of the streptococcal polypeptides of the 
present invention in diagnostic tests and methods for determining and/or monitoring of 
streptococcal infection. Thus, the present invention provides an isolated GBS Ema 
polypeptide, particularly selected from the group of EmaA, EmaB, EmaC, EmaD and 
25 EmaE, labeled with a detectable label. 

In the instance where a radioactive label, such as the isotopes 3 H, 14 C, 32 P, 35 S, 36 C1, 
51 Cr, 57 Co, 58 Co, 59 Fe, 90 Y, l25 I, l31 I, and 186 Re are used, known currently available 
counting procedures may be utilized. In the instance where the label is an enzyme, 
30 detection may be accomplished by any of the presently utilized colorimetric, 
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spectrophotometries flu or o spectrophotometries, amperometric or gasometric 
techniques known in the art. 

The present invention extends to an immunogenic bacterial Ema polypeptide. The 
5 present invention extends to an immunogenic streptococcal Ema polypeptide, 

particularly selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, or a 
fragment thereof The present invention also extends to immunogenic GBS Ema 
polypeptides wherein such polypeptides comprise a combination of at least one 
immunogenic GBS Ema polypeptide, selected from the group of EmaA, EmaB, EmaC, 
10 EmaD and EmaE, or immunogenic polypeptide fragment thereof and GBS polypeptide 
Spbl, Spb2, C protein alpha antigen, Rib or immunogenic fragments thereof. 

As defined herein, "adhesion" means noncovalent binding of a bacteria to a human cell 
or secretion that is stable enough to withstand washing. 

15 

The term "extracellular matrix adhesin", "Ema", "ema" and any variants not specifically 
listed, may be used herein interchangeably, and as used throughout the present 
application and claims refer to proteinaceous material including single or multiple 
proteins, and extends to those proteins having the amino acid sequence data described 

20 herein and particularly identified by (SEQ ID NOS: 2, 4, 6, 8, 10, 23, 26, 29, 32 and 
37), and the profile of activities set forth herein and in the Claims. In particular the 
Ema proteins provided herein include EmaA, EmaB, EmaC, EmaD and EmaE. The 
Ema proteins include bacterial Ema homologs. Bacterial Ema homologs include those 
from streptococcal species and other bacterial species. Accordingly, proteins and 

25 polypeptides displaying substantially equivalent or altered activity are likewise 
contemplated. These modifications may be deliberate, for example, such as 
modifications obtained through site-directed mutagenesis, or may be accidental, such 
as those obtained through mutations in hosts that are producers of one or more Ema 
polypeptide. Also, the term "extracellular matrix adhesin (Ema)" is intended to include 

30 within its scope proteins specifically recited herein as well as all substantially 
homologous analogs and allelic variations. 
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This invention provides an isolated immunogenic polypeptide comprising an amino 
acid sequence of a bacterial Ema polypeptide. This invention provides an isolated 
immunogenic polypeptide comprising an amino acid sequence of a streptococcal Ema 
polypeptide, particularly selected from the group of EmaA, EmaB, EmaC, EmaD and 
5 EmaE. It is contemplated by this invention that the immunogenic polypeptide has the 
amino acid sequence set forth in any of SEQ ID NOS: 2, 4, 6, 8, 10, 23, 26, 29, 32 
and 37, including immunogenic fragments, mutants, variants, analogs, or derivatives, 
thereof. 

10 This invention is directed to analogs of the polypeptide which comprise the amino acid 
sequence as set forth above. The analog polypeptide may have an N-terminal 
methionine or a polyhistidine optionally attached to the N or COOH terminus of the 
polypeptide which comprise the amino acid sequence. 

15 In another embodiment, this invention contemplates peptide fragments of the 

polypeptide which result from proteolytic digestion products of the polypeptide. In 
another embodiment, the derivative of the polypeptide has one or more chemical 
moieties attached thereto. In another embodiment the chemical moiety is a water 
soluble polymer. In another embodiment the chemical moiety is polyethylene glycol. 

20 In another embodiment the chemical moiety is mon-, di-, tri- or tetrapegylated. In 
another embodiment the chemical moiety is N-terminal monopegylated. 

! 

Attachment of polyethylene glycol (PEG) to compounds is particularly useful because 
PEG has very low toxicity in mammals (Carpenter et al, 1971). For example, a PEG 

25 adduct of adenosine deaminase was approved in the United States for use in humans 
for the treatment of severe combined immunodeficiency syndrome. A second 
advantage afforded by the conjugation of PEG is that of effectively reducing the 
immunogenicty and antigenicity of heterologous compounds. For example, a PEG 
adduct of a human protein might be useful for the treatment of disease in other 

30 mammalian species without the risk of triggering a severe immune response. The 
compound of the present invention may be delivered in a microencapsulation device 
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so as to reduce or prevent an host immune response against the compound or against 
cells which may produce the compound. The compound of the present invention may 
also be delivered microencapsulated in a membrane, such as a liposome. 

5 Numerous activated forms of PEG suitable for direct reaction with proteins have been 
described. Useful PEG reagents for reaction with protein amino groups include active 
esters of carboxylic acid or carbonate derivatives, particularly those in which the 
leaving groups are N-hydroxysuccinimide, p-nitrophenol, imidazole or l-hydroxy-2- 
nitrobenzene-4-sulfonate. PEG derivatives containing maleimido or haloacetyl groups 
10 are useful reagents for the modification of protein free sulfhydryl groups. Likewise, 
PEG reagents containing amino hydrazine or hydrazide groups are useful for reaction 
with aldehydes generated by periodate oxidation of carbohydrate groups in proteins. 

In one embodiment, the amino acid residues of the polypeptide described herein are 
15 preferred to be in the M L" isomeric form. In another embodiment, the residues in the 
"D" isomeric form can be substituted for any L-amino acid residue, as long as the 
desired functional property of lectin activity is retained by the polypeptide. NH 2 refers 
to the free amino group present at the amino terminus of a polypeptide. COOH refers 
to the free carboxy group present at the carboxy terminus of a polypeptide. 
20 Abbreviations used herein are in keeping with standard polypeptide nomenclature, J. 
Biol Chem., 243:3552-59 (1969). 

It should be noted that all amino-acid residue sequences are represented herein by 
formulae whose left and right orientation is in the conventional direction of amino- 
25 terminus to carboxy-terminus. Furthermore, it should be noted that a dash at the 
beginning or end of an amino acid residue sequence indicates a peptide bond to a 
further sequence of one or more amino-acid residues. 

Synthetic polypeptide, prepared using the well known techniques of solid phase, liquid 
30 phase, or peptide condensation techniques, or any combination thereof, can include 
natural and unnatural amino acids. Amino acids used for peptide synthesis may be 



WO 02/12294 



PCT/US01/24795 



31 

standard Boc (N a -amino protected N a -t-butyloxycarbonyl) amino acid resin with the 
standard deprotecting, neutralization, coupling and wash protocols of the original solid 
phase procedure of Merrifield (1963, J. Am. Chem. Soc. 85:2149-2154), or the base- 
labile N a -amino protected 9-fluorenylmethoxycarbonyl (Fmoc) amino acids first 
5 described by Carpino and Han (1972, J. Org. Chem. 37:3403-3409). Thus, 

polypeptide of the invention may comprise D-amino acids, a combination of D- and L- 
amino acids, and various "designer" amino acids (e.g., p-methyl amino acids, Ca- 
methyl amino acids, and Na-methyl amino acids, etc.) to convey special properties. 
Synthetic amino acids include ornithine for lysine, fluorophenylalanine for 
10 phenylalanine, and norleucine for leucine or isoleucine. Additionally, by assigning 
specific amino acids at specific coupling steps, cc-helices, P turns, P sheets, y -turns, 
and cyclic peptides can be generated. 

In one aspect of the invention, the peptides may comprise a special amino acid at the 
15 C-terminus which incorporates either a C0 2 H or CONH 2 side chain to simulate a free 
glycine or a glycine-amide group. Another way to consider this special residue would 
be as a D or L amino acid analog with a side chain consisting of the linker or bond to 
the bead. In one embodiment, the pseudo-free C-terminal residue may be of the D or 
the L optical configuration; in another embodiment, a racemic mixture of D and L- 
20 isomers may be used. 

In an additional embodiment, pyroglutamate may be included as the N-terminal residue 
of the peptide. Although pyroglutamate is not amenable to sequence by Edman 
degradation, by limiting substitution to only 50% of the peptides on a given bead with 

25 N-terminal pyroglutamate, there will remain enough non-pyroglutamate peptide on the 
bead for sequencing. One of ordinary skill would readily recognize that this technique 
could be used for sequencing of any peptide that incorporates a residue resistant to 
Edman degradation at the N-terminus. Other methods to characterize individual 
peptides that demonstrate desired activity are described in detail infra. Specific 

30 activity of a peptide that comprises a blocked N-terminal group, e.g., pyroglutamate, 
when the particular N-terminal group is present in 50% of the peptides, would readily 



WO 02/12294 



PCT/US01/24795 



32 

be demonstrated by comparing activity of a completely (100%) blocked peptide with 
non-blocked (0%) peptide. 



In addition, the present invention envisions preparing peptides that have more well 
5 defined structural properties, and the use of peptidomimetics, and peptidomimetic 
bonds, such as ester bonds, to prepare peptides with novel properties. In another 
embodiment, a peptide may be generated that incorporates a reduced peptide bond, 
i.e., R r CH 2 -NH-R 2 , where R t and R 2 are amino acid residues or sequences. A 
reduced peptide bond may be introduced as a dipeptide subunit. Such a molecule 

10 would be resistant to peptide bond hydrolysis, e.g., protease activity. Such peptides 
would provide ligands with unique function and activity, such as extended half-lives in 
vivo due to resistance to metabolic breakdown, or protease activity. Furthermore, it is 
well known that in certain systems constrained peptides show enhanced functional 
activity (Hruby, 1982, Life Sciences 31:189-199; Hruby et al., 1990, Biochem J. 

15 268:249-262); the present invention provides a method to produce a constrained 
peptide that incorporates random sequences at all other positions. 



A constrained, cyclic or rigidized peptide may be prepared synthetically, provided that 
in at least two positions in the sequence of the peptide an amino acid or amino acid 

20 analog is inserted that provides a chemical functional group capable of cross-linking to 
constrain, cyclise or rigidize the peptide after treatment to form the cross-link. 
Cyclization will be favored when a turn-inducing amino acid is incorporated. 
Examples of amino acids capable of cross-linking a peptide are cysteine to form 
disulfide, aspartic acid to form a lactone or a lactase, and a chelator such as 

25 y- carb oxyl-glutamic acid (Gla) (Bachem) to chelate a transition metal and form a 
cross-link. Protected y-carboxyl glutamic acid may be prepared by modifying the 
synthesis described by Zee-Cheng and Olson (1980, Biophys. Biochem. Res. Commim. 
94: 1 128-1 132). A peptide in which the peptide sequence comprises at least two 
amino acids capable of cross-linking may be treated, e.g., by oxidation of cysteine 

30 residues to form a disulfide or addition of a metal ion to form a chelate, so as to cross- 
link the peptide and form a constrained, cyclic or rigidized peptide. 
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The present invention provides strategies to systematically prepare cross-links. For 

example, if four cysteine residues are incorporated in the peptide sequence, different 

protecting groups may be used (Hiskey, 1981, in The Peptides: Analysis, Synthesis, 

5 Biology, Vol. 3, Gross and Meienhofer, eds., Academic Press: New York, pp. 137- 

167; Ponsanti et al., 1990, Tetrahedron 46:8255-8266). The first pair of cysteine may 

be deprotected and oxidized, then the second set may be deprotected and oxidized. In 

this way a defined set of disulfide cross-links may be formed. Alternatively, a pair of 

cysteine and a pair of collating amino acid analogs may be incorporated so that the 

10 cross-links are of a different chemical nature. 

« 

The following non-classical amino acids may be incorporated in the peptide in order to 

* 

introduce particular conformational motifs: 1 ,2,3 ,4-tetrahydroisoquinoline-3 - 
carboxylate (Kazmierski et al, 1991, J. Am. Chem. Soc. 113:2275-2283); (2S,3S)- 

15 methyl-phenylalanine, (2S,3R)-methyl-phenylalanine, (2R,3S)-methyl-phenylalanine 
and (2R,3R)-methyl-phenylalanine (Kazmierski and Hruby, 1991, Tetrahedron Lett.); 
2-aminotetrahydronaphthalene~2-carboxylic acid (Landis, 1989, Ph.D. Thesis, 
University of Arizona); hydroxy-l,2,3,4-tetrahydroisoquinoline-3-carboxylate (Miyake 
et al, 1989, J. Takeda Res. Labs. 43:53-76); p~carboline (D and L) (Kazmierski, 

20 1988, Ph.D. Thesis, University of Arizona); HtC (histidine isoquinoline carboxylic 

acid) (Zechel et al., 1991, Int. J. Pep. Protein Res. 43); and HIC (histidine cyclic urea) 
(Dharanipragada) . 

The following amino acid analogs and peptidomimetics may be incorporated into a 
25 peptide to induce or favor specific secondary structures: LL-Acp (LL-3-amino- 
2-propenidone-6-carboxylic acid), a P -turn inducing dipeptide analog (Kemp et al., 
1985, J. Org. Chem. 50:5834-5838); P-sheet inducing analogs (Kemp et al., 1988, 
Tetrahedron Lett. 29:5081-5082); P-turn inducing analogs (Kemp et al, 1988, 
Tetrahedron Lett. 29:5057-5060); «-helix inducing analogs (Kemp et al, 1988, 
30 Tetrahedron Lett. 29:4935-4938); y-turn inducing analogs (Kemp et al, 1989, J. Org. 
Chem. 54: 109: 115); and analogs provided by the following references: Nagai and 



WO 02/12294 PCT/US01/24795 

34 

Sato., 1985, Tetrahedron Lett. 26:647-650; DiMaio et al., 1989, J, Chem, Soc. Perkin 
Trans, p. 1687; also a Gly-Ala turn analog (Kahn et al., 1989, Tetrahedron Lett. 
30:2317); amide bond isostere (Jones et aL, 1988, Tetrahedron Lett. 29:3853-3856); 
tretrazol (Zabrocki et al, 1988, J. Am. Chem. Soc. 1 10:5875-5880); DTC (Samanen 
5 et al., 1990, Int. J. Protein Pep. Res. 35:501 :509); and analogs taught in Olson et al, 
1990, J. Am. Chem. Sci. 112:323-333 and Garvey et al., 1990, J. Org. Chem. 56:436. 
Conformationally restricted mimetics of beta turns and beta bulges, and peptides 
containing them, are described in U.S. Patent No. 5,440,013, issued August 8, 1995 to 
Kahn. 

10 

The present invention further provides for modification or derivatization of the 
polypeptide or peptide of the invention. Modifications of peptides are well known to 
one of ordinary skill, and include phosphorylation, carboxymethylation, and acylation. 
Modifications may be effected by chemical or enzymatic means. In another aspect, 

15 glycosylated or fatty acylated peptide derivatives may be prepared. Preparation of 
glycosylated or fatty acylated peptides is well known in the art. Fatty acyl peptide 
derivatives may also be prepared. For example, and not by way of limitation, a free 
amino group (N-terminal or lysyl) may be acylated, e.g., myristoylated. In another 
embodiment an amino acid comprising an aliphatic side chain of the structure - 

20 (CH 2 ) n CH 3 may be incorporated in the peptide. This and other peptide-fatty acid 
conjugates suitable for use in the present invention are disclosed in U.K. Patent GB- 
8 809 1 62.4, International Patent Application PCT/AU89/00 1 66, and reference 5, 
supra. 

25 Chemical Moieties For Derivatization. Chemical moieties suitable for derivatization 
may be selected from among water soluble polymers. The polymer selected should be 
water soluble so that the component to which it is attached does not precipitate in an 
aqueous environment, such as a physiological environment. Preferably, for therapeutic 
use of the end-product preparation, the polymer will be pharmaceutical^ acceptable. 

30 One skilled in the art will be able to select the desired polymer based on such 
considerations as whether the polymer/component conjugate will be used 
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therapeutically, and if so, the desired dosage, circulation time, resistance to 
proteolysis, and other considerations. For the present component or components, 
these may be ascertained using the assays provided herein. 

5 The water soluble polymer may be selected from the group consisting of, for example, 
polyethylene glycol, copolymers of ethylene glycol/propylene glycol, 
carboxymethylcellulose, dextran, polyvinyl alcohol, polyvinyl pyrrolidone, poly-1, 
3-dioxolane, poly-1, 3, 6-trioxane, ethylene/maleic anhydride copolymer, 
polyaminoacids (either homopolymers or random copolymers), and dextran or 
10 poly(n- vinyl pyrrolidone)polyethylene glycol, propropylene glycol homopolymers, 
prolypropylene oxide/ethylene oxide co- polymers, polyoxyethylated polyols and 
polyvinyl alcohol. Polyethylene glycol propionaldenhyde may have advantages in 
manufacturing due to its stability in water. 

15 The polymer may be of any molecular weight, and may be branched or unbranched. 
For polyethylene glycol, the preferred molecular weight is between about 2kDa and 
about lOOkDa (the term "about" indicating that in preparations of polyethylene glycol, 
some molecules will weigh more, some less, than the stated molecular weight) for ease 
in handling and manufacturing. Other sizes may be used, depending on the desired 

20 therapeutic profile (e.g., the duratibn of sustained release desired, the effects, if any 
on biological activity, the ease in handling, the degree or lack of antigenicity and other 
known effects of the polyethylene glycol to a therapeutic protein or analog). 

The number of polymer molecules so attached may vary, and one skilled in the art will 
25 be able to ascertain the effect on function. One may mono-derivative, or may provide 
for a di-, tri-, tetra- or some combination of derivatization, with the same or different 
chemical moieties (e.g., polymers, such as different weights of polyethylene glycols). 
The proportion of polymer molecules to component or components molecules will 
vary, as will their concentrations in the reaction mixture. In general, the optimum ratio 
30 (in terms of efficiency of reaction in that there is no excess unreacted component or 
components and polymer) will be determined by factors such as the desired degree of 
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derivatization (e.g., mono, di-, tri-, etc.), the molecular weight of the polymer selected, 
whether the polymer is branched or unbranched, and the reaction conditions. 

The polyethylene glycol molecules (or other chemical moieties) should be attached to 
5 the component or components with consideration of effects on functional or antigenic 
domains of the protein. There are a number of attachment methods available to those 
skilled in the art, e.g., EP 0 401 384 herein incorporated by reference (coupling PEG 
to G-CSF), see also Malik etal, 1992, Exp. Hematol 20:1028-1035 (reporting 
pegylation of GM-CSF using tresyl chloride). For example, polyethylene glycol may 

10 be covalently bound through amino acid residues via a reactive group, such as, a free 
amino or carboxyl group. Reactive groups are those to which an activated 
polyethylene glycol molecule may be bound. The amino acid residues having a free 
amino group include lysine residues and the - terminal amino acid residues; those 
having a free carboxyl group include aspartic acid residues glutamic acid residues and 

15 the C-terminal amino acid residue. Sulfhydrl groups may also be used as a reactive 
group for attaching the polyethylene glycol molecule(s). Preferred for therapeutic 
purposes is attachment at an amino group, such as attachment at the N-terminus or 
lysine group. 

20 Nucleic Acids 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill of 
the art. Such techniques are explained fully in the literature. See, e.g., Sambrook et 

25 al, "Molecular Cloning: A Laboratory Manual" (1989); "Current Protocols in 

Molecular Biology" Volumes Mil [Ausubel, R. M., ed. (1994)]; "Cell Biology: A 
Laboratory Handbook" Volumes I-III [J. E. Celis, ed. (1994))]; "Current Protocols in 
Immunology" Volumes I-III [Coligan, J. E., ed. (1994)]; "Oligonucleotide Synthesis" 
(M.J. Gait ed. 1984); "Nucleic Acid Hybridization" [B.D. Hames & S.J. Higgins eds. 

30 (1985)]; "Transcription And Translation" [B.D. Hames & S.J. Higgins, eds. (1984)]; 



* 
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"Animal Cell Culture" [R.I. Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" 
[IRL Press, (1986)]; B. Perbal, "A Practical Guide To Molecular Cloning" (1984). 

Mutations can be made in a nucleic acid encoding the polypeptide of the present 
5 invention such that a particular codon is changed to a codon which codes for a 
different amino acid. Such a mutation is generally made by making the fewest 
nucleotide changes possible. A substitution mutation of this sort can be made to 
change an amino acid in the resulting protein in a non-conservative manner (i.e., by 
changing the codon from an amino acid belonging to a grouping of amino acids having 
10 a particular size or characteristic to an amino acid belonging to another grouping) or in 
a conservative manner (i.e., by changing the codon from an amino acid belonging to a 
grouping of amino acids having a particular size or characteristic to an amino acid 
belonging to the same grouping). Such a conservative change generally leads to less 
change in the structure and function of the resulting protein. A non-conservative 
15 change is more likely to alter the structure, activity or function of the resulting protein. 
The present invention should be considered to include sequences containing 
conservative changes which do not significantly alter the activity or binding 
characteristics of the resulting protein. Substitutes for an amino acid within the 
sequence may be selected from other members of the class to which the amino acid 
20 belongs. For example, the nonpolar (hydrophobic) amino acids include alanine, 

leucine, isoleucine, valine, proline, phenylalanine, tryptophan and methionine. Amino 
acids containing aromatic ring structures are phenylalanine, tryptophan, and tyrosine. 
The polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, 
asparagine, and glutamine. The positively charged (basic) amino acids include 
25 arginine, lysine and histidine. The negatively charged (acidic) amino acids include 
aspartic acid and glutamic acid. Such alterations will not be expected to affect 
apparent molecular weight as determined by polyacrylamide gel electrophoresis, or 
isoelectric point. 

30 Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 
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- Glu for Asp and vice versa such that a negative charge may be maintained; 

- Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free NH 2 can be maintained. 

5 Synthetic DNA sequences allow convenient construction of genes which will express 
analogs or "muteins". A general method for site-specific incorporation of unnatural 
amino acids into proteins is described inNoren, et al. Science, 244:182-188 (April 
1989). This method may be used to create analogs with unnatural amino acids. 

10 This invention provides an isolated nucleic acid encoding a polypeptide comprising an 
amino acid sequence of a streptococcal Ema polypeptide. This invention provides an 
isolated nucleic acid encoding a polypeptide comprising an amino acid sequence of a 
streptococcal Ema polypeptide. This invention provides an isolated nucleic acid 
encoding a polypeptide comprising an amino acid sequence of a Group B 

15 streptococcal Ema polypeptide selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE. This invention provides an isolated nucleic acid encoding a 
polypeptide comprising an amino acid sequence of a Group B streptococcal Ema 
protein selected from the group of Ema proteins EmA, EmaB, EmaC. EmaD and 
EmaE as set forth in FIGURES 2-6. The invention provides an isolated nucleic acid 

20 encoding a polypeptide comprising an amino acid sequence of a bacterial Ema 
polypeptide selected from the group of SEQ ID NO: 23, 26, 29, 32 and 37. In 
particular embodiments the nucleic acid is set forth in any of SEQ ID NOS: 1, 3, 5, 7, 
9, 24, 27, 30, and 33, including fragments, mutants, variants, analogs, or derivatives, 
thereof The nucleic acid is DNA, cDNA, genomic DNA, RNA. Further, the isolated 

25 nucleic acid may be operatively linked to a promoter of RNA transcription. 

The present invention also relates to isolated nucleic acids, such as recombinant DNA 
molecules or cloned genes, or degenerate variants thereof, mutants, analogs, or 
fragments thereof, which encode the isolated polypeptide or which competitively 
30 inhibit the activity of the polypeptide. The present invention further relates to isolated 
nucleic acids, such as recombinant DNA molecules or cloned genes, or degenerate 
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variants thereof, mutants, analogs, or fragments thereof, which encode a GBS Ema 
polypeptide, particularly selected from the group of Ema A, EmaB, EmaC, EmaD and 
EmaE. Preferably, the isolated nucleic acid, which includes degenerates, variants, 
mutants, analogs, or fragments thereof, has a sequence as set forth in SEQ ID NOS: 1, 
5 3, 5, 7 or 9. In a further embodiment of the invention, the DNA sequence of the 

recombinant DNA molecule or cloned gene may be operatively linked to an expression 
control sequence which may be introduced into an appropriate host. The invention 
accordingly extends to unicellular hosts transformed with the cloned gene or 
recombinant DNA molecule comprising a DNA sequence encoding an Ema protein, 
10 particularly selected from the group of Ema A, EmaB, EmaC, EmaD and EmaE, and 
more particularly, the DNA sequences or fragments thereof determined from the 
sequences set forth above. 

In a particular embodiment, the nucleic acid encoding the EmaA polypeptide has the 
15 sequence selected from the group comprising SEQ ID NO: 1; a sequence that 

hybridizes to SEQ ID NO.l under moderatestringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO :l or 
a sequence that hybridizes to SEQ ID NO:l under moderatestringency hybridization 
conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
20 thereof. In a particular embodiment, the nucleic acid encoding the EmaA polypeptide 
has the sequence selected from the group comprising SEQ ID NO:l; a sequence 
complementary to SEQ ID NO: 1; or a homologous sequence which is substantially 
similar to SEQ ID NO: 1. In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID NO:l. 

In a particular embodiment, the nucleic acid encoding the EmaB polypeptide has the 
sequence selected from the group comprising SEQ ED NO:3; a sequence that 
hybridizes to SEQ ID NO:3 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO: 3 or 
30 a sequence that hybridizes to SEQ ID NO:3 under moderate stringency hybridization 
conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof. In a particular embodiment, the nucleic acid encoding the EmaB polypeptide 
has the sequence selected from the group comprising SEQ ID NO:3; a sequence 
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complementary to SEQ ID NO:3; or a homologous sequence which is substantially 
similar to SEQ ID NO:3. In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID NO:3. 

5 In a particular embodiment, the nucleic acid encoding the EmaC polypeptide has the 
sequence selected from the group comprising SEQ ID NO: 5; a sequence that 
hybridizes to SEQ ID NO: 5 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO:5 or 
a sequence that hybridizes to SEQ ID NO: 5 under moderate stringency hybridization 

10 conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof In a particular embodiment, the nucleic acid encoding the EmaC polypeptide 
has the sequence selected from the group comprising SEQ ID NO: 5; a sequence 
complementary to SEQ ED NO: 5; or a homologous sequence which is substantially 
similar to SEQ ID NO:5. In a further embodiment, the nucleic acid has the sequence 

15 consisting of SEQ ID NO:5. 

In a particular embodiment, the nucleic acid encoding the EmaD polypeptide has the 
sequence selected from the group comprising SEQ ID NO: 7; a sequence that 
hybridizes to SEQ ID NO: 7 under moderate stringency hybridization conditions; DNA 

20 sequences capable of encoding the amino acid sequence encoded by SEQ ID NO: 7 or 
a sequence that hybridizes to SEQ ID NO: 7 under moderate stringency hybridization 
conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof. In a particular embodiment, the nucleic acid encoding the EmaD polypeptide 
has the sequence selected from the group comprising SEQ ID NO: 7; a sequence 

25 complementary to SEQ ID NO:7; or a homologous sequence which is substantially 
similar to SEQ ID NO:7. In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID NO:7. 

In a particular embodiment, the nucleic acid encoding the EmaE polypeptide has the 
30 sequence selected from the group comprising SEQ ID NO: 9; a sequence that 

hybridizes to SEQ ID NO: 9 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ, ID NO: 9 or 
a sequence that hybridizes to SEQ ED NO: 9 under moderate stringency hybridization 
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conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof. In a particular embodiment, the nucleic acid encoding the EmaE polypeptide 
has the sequence selected from the group comprising SEQ ID NO: 9; a sequence 
complementary to SEQ ID NO: 9; or a homologous sequence which is substantially 
5 similar to SEQ ID NO:9 In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID NO:9. 

A nucleic acid capable of encoding a GBS polypeptide EmaA, EmaB, EmaC, EmaD or 
EmaE which is a recombinant DNA molecule is further provided. Such a recombinant 
10 DNA molecule wherein the DNA molecule is operatively linked to an expression 
control sequence is also provided herein. 

The present invention relates to nucleic acid vaccines or DNA vaccines comprising 
nucleic acids encoding immunogenic bacterial Ema polypeptides, particularly 

15 immunogenic streptococcal Ema polypeptides. The present invention relates to nucleic 
acid vaccines or DNA vaccines comprising nucleic acids encoding immunogenic GBS 
Ema polypeptides, particularly selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE. The present invention relates to nucleic acid vaccines or DNA 
vaccines comprising nucleic acids encoding one or more immunogenic GBS Ema 

20 polypeptide or a fragment thereof or any combination of one or more Ema polypeptide 
EmaA, EmaB, EmaC, EmaD or EmaE with at least one other GBS polypeptide, 
particularly wherein said other GBS polypeptide is selected from the group of Spbl, 
Spb2, C protein alpha antigen, Rib and immunogenic polypeptide fragments thereof. 

25 The invention further relates to a vaccine for protection of an animal subject from 
infection with a streptococcal bacterium comprising a vector containing a gene 
encoding an Ema polypeptide, particularly selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE, operatively associated with a promoter capable of directing 
expression of the gene in the subject. The invention further relates to a vaccine for 

30 protection of an animal subject from infection with a Group B streptococcal bacterium 
comprising a vector containing a gene encoding an Ema polypeptide selected from the 
group of EmaA, EmaB, EmaC, EmaD and EmaE operatively associated with a 
promoter capable of directing expression of the gene in the subject. The present 
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invention further provides a nucleic acid vaccine comprising a recombinant DNA 
molecule capable of encoding a GBS polypeptide EmaA EmaB, EmaC, EmaD 
EmaE. 



or 



5 The present invention provides a vector which comprises the nucleic acid capable of 
encoding encoding a bacterial Ema polypeptide, particularly a streptococcal Ema 
polypeptide. The present invention provides a vector which comprises the nucleic acid 
capable of encoding encoding an Ema polypeptide selected from the group of EmaA, 
EmaB, EmaC, EmaD and EmaE and a promoter. The present invention provides a 
10 vector which comprises the nucleic acid of any of SEQ ID NO: 1, 3, 5, 7, 9, 24, 27, 
30, and 33, and a promoter. The invention contemplates a vector wherein the 
promoter comprises a bacterial, yeast, insect or mammalian promoter. The invention 
contemplates a vector wherein the vector is a plasmid, cosmid, yeast artificial 
chromosome (YAC), bacteriophage or eukaryotic viral DNA. 

15 

The present invention further provides a host vector system for the production of a 
polypeptide which comprises the vector capable of encoding encoding an Ema 
polypeptide, particularly selected from the group of EmaA EmaB, EmaC, EmaD and 
EmaE, in a suitable host cell. A host vector system is provided wherein the suitable 
20 host cell comprises a prokaryotic or eukaryotic cell. A unicellular host transformed 
with a recombinant DNA molecule or vector capable of encoding encoding an Ema 
polypeptide, particularly selected from the group of EmaA EmaB, EmaC, EmaD and 
EmaE, is thereby provided. 

25 A "vector" is a replicon, such as plasmid, phage or cosmid, to which another DNA 
segment may be attached so as to bring about the replication of the attached segment. 

A "DNA" or "DNA molecule" refers to the polymeric form of deoxyribonucleotides 
(adenine, guanine, thymine, or cytosine) in its either single stranded form, or a double- 
30 stranded helix. This term refers only to the primary and secondary structure of the 
molecule, and does not limit it to any particular tertiary forms. Thus, this term 
« includes double-stranded DNA found, inter alia, in linear DNA molecules, (e.g., 

restriction fragments), viruses, plasmids, and chromosomes. In discussing the 
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structure of particular double-stranded DNA molecules, sequences may be described 
herein according to the normal convention of giving only the sequence in the 5' to 3' 
direction along the nontranscribed strand of DNA (i.e., the strand having a sequence 
homologous to the mRNA). 

5 

An "origin of replication" refers to those DNA sequences that participate in DNA 
synthesis. 

A DNA "coding sequence" is a double-stranded DNA sequence which is transcribed 

10 and translated into a polypeptide in vivo when placed under the control of appropriate 

* ■ « 

regulatory sequences. The boundaries of the coding sequence are determined by a 
start codon at the 5' (amino) terminus and a translation stop codon at the 3 ! (carboxyl) 
terminus. A coding sequence can include, but is not limited to, prokaryotic sequences, 
cDNA from eukaryotic mRNA, genomic DNA sequences from eukaryotic (e.g., 
15 mammalian) DNA, and even synthetic DNA sequences. A polyadenylation signal and 
transcription termination sequence will usually be located 3' to the coding sequence in 
the case of eukaryotic mRNA. 

Transcriptional and translational control sequences are DNA regulatory sequences, 
20 such as promoters, enhancers, polyadenylation signals, terminators, and the like, that 
provide for the expression of a coding sequence in a host cell. 

A "promoter sequence" is a DNA regulator/ region capable of binding RNA 
polymerase in a cell and initiating transcription of a downstream (3* direction) coding 

25 sequence. For purposes of defining the present invention, the promoter sequence is 
bounded at its 3' terminus by the transcription initiation site and extends upstream (5' 
direction) to include the minimum number of bases or elements necessary to initiate 
transcription at levels detectable above background. Within the promoter sequence 
will be found a transcription initiation site (conveniently defined by mapping with 

30 nuclease SI), as well as protein binding domains (consensus sequences) responsible for 
the binding of RNA polymerase. Eukaryotic promoters will often, but not always, 
contain "TATA" boxes and "CAT" boxes. Prokaryotic promoters contaiij Shine- 
Dalgarno sequences in addition to the -10 and -35 consensus sequences. 
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An "expression control sequence" is a DNA sequence that controls and regulates the 
transcription and translation of another DNA sequence. A coding sequence is "under 
the control" of transcriptional and translational control sequences in a cell when RNA 
polymerase transcribes the coding sequence into mRNA, which is then translated into 
5 the protein encoded by the coding sequence. 

A "signal sequence" can be included before the coding sequence. This sequence 
encodes a signal peptide, N-terminal to the polypeptide, that communicates to the host 
cell to direct the polypeptide to the cell surface or secrete the polypeptide into the 
10 media, and this signal peptide is clipped off by the host cell before the protein leaves 
the cell. Signal sequences can be found associated with a variety of proteins native to 
prokaryotes and eukaryotes. 

The term "oligonucleotide," as used herein in referring to the probe of the present 
15 invention, is defined as a molecule comprised of two or more ribonucleotides, 

preferably more than three. Its exact size will depend upon many factors which, in 
turn, depend upon the ultimate function and use of the oligonucleotide. 

The term "primer" as used herein refers to an oligonucleotide, whether occurring 
20 naturally as in a purified restriction digest or produced synthetically, which is capable 
of acting as a point of initiation of synthesis when placed under conditions in which 
synthesis of a primer extension product, which is complementary to a nucleic acid 
strand, is induced, i.e., in the presence of nucleotides and an inducing agent such as a 
DNA polymerase and at a suitable temperature and pH. The primer may be either 
25 single-stranded or double-stranded and must be sufficiently long to prime the synthesis 
of the desired extension product in the presence of the inducing agent. The exact 
length of the primer will depend upon many factors, including temperature, source of 
primer and use of the method. For example, for diagnostic applications, depending on 
the complexity of the target sequence, the oligonucleotide primer typically contains 
30 15-25 or more nucleotides, although it may contain fewer nucleotides. 

The primers herein are selected to be "substantially" complementary to different 
strands of a particular target DNA sequence. This means that the primers must be 
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sufficiently complementary to hybridize with their respective strands. Therefore, the 
primer sequence need not reflect the exact sequence of the template. For example, a 
non-complementary nucleotide fragment may be attached to the 5* end of the primer, 
with the remainder of the primer sequence being complementary to the strand. 
5 Alternatively, non-complementary bases or longer sequences can be interspersed into 
the primer, provided that the primer sequence has sufficient complementarity with the 
sequence of the strand to hybridize therewith and thereby form the template for the 
synthesis of the extension product. 

10 As used herein, the terms "restriction endonucleases" and "restriction enzymes" refer 
to bacterial enzymes, each of which cut double-stranded DNA at or near a specific 
nucleotide sequence. 

A cell has been "transformed" by exogenous or heterologous DNA when such DNA 
15 has been introduced inside the cell. The transforming DNA may or may not be 

integrated (covalently linked) into chromosomal DNA making up the genome of the 
cell. In prokaryotes, yeast, and mammalian cells for example, the transforming DNA 
may be maintained on an episomal element such as a plasmid. With respect to 
eukaryotic cells, a stably transformed cell is one in which the transforming DNA has 
20 become integrated into a chromosome so that it is inherited by daughter cells through 
chromosome replication. This stability is demonstrated by the ability of the eukaryotic 
cell to establish cell lines or clones comprised of a population of daughter cells 
containing the transforming DNA. A "clone" is a population of cells derived from a 
single cell or common ancestor by mitosis. A "cell line" is a clone of a primary cell 
25 that is capable of stable growth in vitro for many generations. 

Two DNA sequences are "substantially homologous" when at least about 75% 
(preferably at least about 80%, and most preferably at least about 90 or 95%) of the 
nucleotides match over the defined length of the DNA sequences. Sequences that are 
30 substantially homologous can be identified by comparing the sequences using standard 
software available in sequence data banks, or in a Southern hybridization experiment 
under, for example, stringent conditions as defined for that particular system. Defining 
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appropriate hybridization conditions is within the skill of the art. See, e.g., Maniatis et 
al, supra; DNA Cloning, Vols. I & II, supra; Nucleic Acid Hybridization, supra. 



A DNA sequence is "operatively linked" to an expression control sequence when the 

5 expression control sequence controls and regulates the transcription and translation of 

that DNA sequence. The term "operatively linked" includes having an appropriate 

start signal (e.g., ATG) in front of the DNA sequence to be expressed and maintaining 

the correct reading frame to permit expression of the DNA sequence under the control 

of the expression control sequence and production of the desired product encoded by 

10 the DNA sequence. If a gene that one desires to insert into a recombinant DNA 

« 

molecule does not contain an appropriate start signal, such a start signal can be 
inserted in front of the gene. 

The term "standard hybridization conditions" refers to salt and temperature conditions 
15 substantially equivalent to 5 x SSC and 65°C for both hybridization and wash. 
However, one skilled in the art will appreciate that such "standard hybridization 
conditions" are dependent on particular conditions including the concentration of 
sodium and magnesium in the buffer, nucleotide sequence length and concentration, 
percent mismatch, percent formamide, and the like. Also important in the 
20 determination of "standard hybridization conditions" is whether the two sequences 
hybridizing are RNA-RNA, DNA-DNA or RNA-DNA. Such standard hybridization 
conditions are easily determined by one skilled in the art according to well known 
formulae, wherein hybridization is typically 10-20°C below the predicted or 
determined T m with washes of higher stringency, if desired. 

25 

It should be appreciated that also within the scope of the present invention are DNA 
sequences encoding an Ema polypeptide EmaA, EmaB, EmaC, EmaD or EmaE which 
code for an Ema polypeptide having the same amino acid sequence as any of SEQ ID 
NOS:2, 4, 6, 8 or 10, but which are degenerate to any of SEQ ID NOS: 1, 3, 5, 7 or 9. 
30 By "degenerate to" is meant that a different three-letter codon is used to specify a 
particular amino acid. It is well known in the art that the following codons can be 
used interchangeably to code for each specific amino acid: 
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Phenylalanine (Phe or F) UUU or UUC 

Leucine (Leu or L) UUA or UUG or CUU or CUC or CUA or CUG 

Isoleucine (lie or I) AUU or AUC or AUA 

Methionine (Met or M) AUG 

5 Valine ( Val or V) GUU or GUC of GUA or GUG 

Serine (Ser or S) UCU or UCC or UCA or UCG or AGU or AGC 

Proline (Pro or P) CCU or CCC or CCA or CCG 

Threonine (Thr or T) ACU or ACC or ACA or ACG 

Alanine (Ala or A) GCU or GCG or GCA or GCG 

1 0 Tyrosine (Tyr or Y) UAU or UAC 

Histidine (His or H) CAU or CAC 

Glutamine (Gin or Q) CAA or CAG 

Asparagine (Asn or N) AAU or AAC 

Lysine (Lys or K) AAA or AAG 

15 Aspartic Acid (Asp or D) GAU or GAC 

Glutamic Acid (Glu or E) GAA or GAG 

Cysteine (Cys or C) UGU or UGC 

Arginine (Arg or R) CGU or CGC or CGA or CGG or AGA or AGG 

Glycine (Gly or G) GGU or GGC or GGA or GGG 

20 Tryptophan (Trp or W) UGG 

Termination codon UAA (ochre) or UAG (amber) or UGA (opal) 

It should be understood that the codons specified above are for RNA sequences. The 
corresponding codons for DNA have a T substituted for U. 

25 

Mutations can be made in SEQ ID NOS: 1, 3, 5, 7 or 9 such that a particular codon is 
changed to a codon which codes for a different amino acid. Such a mutation is 
generally made by making the fewest nucleotide changes possible. A substitution 
mutation of this sort can be made to change an amino acid in the resulting protein in a 
30 non-conservative manner (i.e., by changing the codon from an amino acid belonging to 
a grouping of amino acids having a particular size or characteristic to an amino acid 
belonging to another grouping) or in a conservative manner (i.e., by changing the 
codon from an amino acid belonging to a grouping of amino acids having a particular 
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size or characteristic to an amino acid belonging to the same grouping). Such a 
conservative change generally leads to less change in the structure and function of the 
resulting protein. A non-conservative change is more likely to alter the structure, 
activity or function of the resulting protein. The present invention should be 
5 considered to include sequences containing conservative changes which do not 
significantly alter the activity or binding characteristics of the resulting protein. 

Two amino acid sequences are "substantially homologous" when at least about 70% of 
the amino acid residues (preferably at least about 80%, and most preferably at least 
10 about 90 or 95%) are identical, or represent conservative substitutions. 

A "heterologous" region of the DNA construct is an identifiable segment of DNA 
within a larger DNA molecule that is not found in association with the larger molecule 
in nature. Thus, when the heterologous region encodes a mammalian gene, the gene 

15 will usually be flanked by DNA that does not flank the mammalian genomic DNA in 
the genome of the source organism. Another example of a heterologous coding 
sequence is a construct where the coding sequence itself is not found in nature (e.g., a 
cDNA where the genomic coding sequence contains introns, or synthetic sequences 
having codons different than the native gene). Allelic variations or naturally-occurring 

20 mutational events do not give rise to a heterologous region of DNA as defined herein. 

A DNA sequence is "operatively linked" to an expression control sequence when the 
expression control sequence controls and regulates the transcription and translation of 
that DNA sequence. The term "operatively linked" includes having an appropriate 

25 start signal (e.g., ATG) in front of the DNA sequence to be expressed and maintaining 
the correct reading frame to permit expression of the DNA sequence under the control 
of the expression control sequence and production of the desired product encoded by 
the DNA sequence. If a gene that one desires to insert into a recombinant DNA 
molecule does not contain an appropriate start signal, such a start signal can be 

30 inserted in front of the gene. 



Further this invention also provides a vector which comprises the above-dpscribed 
nucleic acid molecule. The promoter may be, or is identical to, a bacterial, yeast, 
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insect or mammalian promoter. Further, the vector may be a plasmid, cosmid, yeast 
artificial chromosome (YAC), bacteriophage or eukaryotic viral DNA. Other 
numerous vector backbones known in the art as useful for expressing protein may be 
employed. Such vectors include, but are not limited to: adenovirus, simian virus 40 
5 (S V40), cytomegalovirus (CMV), mouse mammary tumor virus (MMTV), Moloney 
murine leukemia virus, DNA delivery systems, i.e. liposomes, and expression plasmid 
delivery systems. Such vectors may be obtained commercially or assembled from the 
sequences described by methods well-known in the art. 

10 This invention also provides a host vector system for the production of a polypeptide 
which comprises the vector of a suitable host cell. A wide variety of unicellular host 
cells are also useful in expressing the DNA sequences of this invention. These hosts 
may include well known eukaryotic and prokaryotic hosts, such as strains of E. coli, 
Pseudomonas, Bacillus, Streptomyces, fungi such as yeasts, and animal cells, such as 

15 CHO, Rl.l, B-W and L-M cells, African Green Monkey kidney cells (e.g., COS 1, 
COS 7, BSC1, BSC40, and BMT10), insect cells (e.g., Sf9), and human cells and 
plant cells in tissue culture. 

A wide variety of host/expression vector combinations may be employed in expressing 
20 the DNA sequences of this invention. Useful expression vectors, for example, may 
consist of segments of chromosomal, non-chromosomal and synthetic DNA 
sequences. Suitable vectors include derivatives of S V40 and known bacterial 
plasmids, e.g., E. coli plasmids col El, pCRl, pBR322, pMB9 and their derivatives, 
plasmids such as RP4; phage DNAs, e.g., the numerous derivatives of phage A, Ml 3 
25 and filamentous single stranded phage DNA; yeast plasmids such as the 2\y plasmid or 
derivatives thereof; vectors useful in eukaryotic cells, such as vectors useful in insect 
or mammalian cells; vectors derived from combinations of plasmids and phage DNAs, 
such as plasmids that have been modified to employ phage DNA or other expression 
control sequences; and the like. 

30 

Any of a wide variety of expression control sequences — sequences that control the 
expression of a DNA sequence operatively linked to it ~ may be used in these vectors 
to express the DNA sequences of this invention. Such useful expression control 
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sequences include, for example, the early or late promoters of SV40, CMV, vaccinia, 
polyoma or adenovirus, the lac system, the trp system, the TAC system, the TRC 
system, the LTR system, the major operator and promoter regions of phage A, the 
control regions of fd coat protein, the promoter for 3-phosphoglycerate kinase or 
5 other glycolytic enzymes, the promoters of acid phosphatase (e.g., Pho5), the 

promoters of the yeast a-mating factors, and other sequences known to control the 
expression of genes of prokaryotic or eukaryotic cells or their viruses, and various 
combinations thereof. 

10 It will be understood that not all vectors, expression control sequences and hosts will 
function equally well to express the DNA sequences of this invention. Neither will all 
hosts function equally well with the same expression system. However, one skilled In 
the art will be able to select the proper vectors, expression control sequences, and 
hosts without undue experimentation to accomplish the desired expression without 

15 departing from the scope of this invention. For example, in selecting a vector, the host 
must be considered because the vector must Sanction in it. The vector's copy number, 
the ability to control that copy number, and the expression of any other proteins 
encoded by the vector, such as antibiotic markers, will also be considered. 

20 In selecting an expression control sequence, a variety of factors will normally be 
considered. These include, for example, the relative strength of the system, its 
controllability, and its compatibility with the particular DNA sequence or gene to be 
expressed, particularly as regards potential secondary structures. Suitable unicellular 
hosts will be selected by consideration of, e.g., their compatibility with the chosen 

25 vector, their secretion characteristics, their ability to fold proteins correctly, and their 
fermentation requirements, as well as the toxicity to the host of the product encoded 
by the DNA sequences to be expressed, and the ease of purification of the expression 
products. 

30 This invention further provides a method of producing a polypeptide which comprises 
growing the above-described host vector system under suitable conditions permitting 
the production of the polypeptide and recovering the polypeptide so produced. 
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As used herein, "pg" means picogram, "ng M means nanogram, "ug" or Vg" mean 
microgram, "mg" means milligram, "uI" or >1" mean microliter, "ml" means milliliter, 
"1" means liter. 

5 The present invention extends to the preparation of antisense oligonucleotides and 
ribozymes that may be used to interfere with the expression of one or more Ema 
protein at the translational level. This approach utilizes antisense nucleic acid and 
ribozymes to block translation of a specific mRNA, either by masking that mRNA with 
an antisense nucleic acid or cleaving it with a ribozyme. 

10 

Antisense nucleic acids are DNA or RNA molecules that are complementary to at least 
a portion of a specific mRNA molecule. (See Weintraub, 1990; Marcus-Sekura, 1988.) 
In the cell, they hybridize to that mRNA, forming a double stranded molecule. The 
cell does not translate an mRNA in this double-stranded form. Therefore, antisense 

15 nucleic acids interfere with the expression of mRNA into protein. Oligomers of about 
fifteen nucleotides and molecules that hybridize to the AUG initiation codon will be 
particularly efficient, since they are easy to synthesize and are likely to pose fewer 
problems than larger molecules when introducing them into Ema-producing cells. 
Antisense methods have been used to inhibit the expression of many genes in vitro 

20 (Marcus-Sekura, 1988; Hambor et al, 1988). 

Ribozymes are RNA molecules possessing the ability to specifically cleave other single 
stranded RNA molecules in a manner somewhat analogous to DNA restriction 
endonucleases. Ribozymes were discovered from the observation that certain mRNAs 
25 have the ability to excise their own introns. By modifying the nucleotide sequence of 
these RNAs, researchers have been able to engineer molecules that recognize specific 
nucleotide sequences in an RNA molecule and cleave it (Cech, 1988.). Because they 
are sequence-specific, only mRNAs with particular sequences are inactivated. 

30 Investigators have identified two types of ribozymes, Tetrahymena-type and 

M hammerhead"-type. (Hasselhoff and Gerlach, 1988) Tetrahymena-type ribozymes 
recognize four-base sequences, while n hammerhead M -type recognize eleven- to 
eighteen-base sequences. The longer the recognition sequence, the more likely it is to 
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occur exclusively in the target mRNA species. Therefore, hammerhead-type 
ribozymes are preferable to Tetrahymena-type ribozymes for inactivating a specific 
mRNA species, and eighteen base recognition sequences are preferable to shorter 
recognition sequences. 

5 

Antibodies 

This invention further provides an antibody capable of specifically recognizing or 
binding to the isolated Ema polypeptide of the present invention. The antibody may be 
10 a monoclonal or polyclonal antibody. Further, the antibody may be labeled with a 
detectable marker that is either a radioactive, calorimetric, fluorescent, or a 
luminescent marker. The labeled antibody may be a polyclonal or monoclonal 
antibody. In one embodiment, the labeled antibody is a purified labeled antibody. 
Methods of labeling antibodies are well known in the art. 

15 

In a further aspect, the present invention provides a purified antibody to a bacterial 
Ema polypeptide, particularly a streptococcal Ema polypeptide. In a still further 
aspect, the present invention provides a purified antibody to a Group B sreptococcal 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. 

20 

Antibodies against the isolated polypeptides of the present invention include naturally 
raised and recombinantly prepared antibodies. These may include both polyclonal and 
monoclonal antibodies prepared by known genetic techniques, as well as bi-specific 
(chimeric) antibodies, and antibodies including other functionalities suiting them for 
25 diagnostic use. Such antibodies can be used in immunoassays to diagnose infection 
with a particular strain or species of bacteria. The antibodies can also be used for 
passive immunization to treat an infection with Group B streptococcal bacteria. These 
antibodies may also be suitable for modulating bacterial adherence and/or invasion 
including but not limited to acting as competitive agents. 

30 

The present invention provides a monoclonal antibody to a Group B streptococcal 
poypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. The 
invention thereby extends to an immortal cell line that produces a monoclonal antibody 
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to a Group B streptococcal poypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE. 

An antibody to an Ema polypeptide, particularly selected from EmaA, EmaB, EmaC. 
5 EmaD or EmaE, labeled with a detectable label is further provided. In particular 
embodiments, the label may selected from the group consisting of an enzyme, a 
chemical which fluoresces, and a radioactive element. 

The term "antibody" includes, by way of example, both naturally occurring and non- 
10 naturally occurring antibodies. Spe^fically, the term "antibody" includes polyclonal 
and monoclonal antibodies, and fragments thereof. Furthermore, the term "antibody" 
includes chimeric antibodies and wholly synthetic antibodies, and fragments thereof. 
Such antibodies include but are not limited to polyclonal, monoclonal, chimeric, single 
chain, Fab fragments, and an Fab expression library. 

15 

An "antibody" is any immunoglobulin, including antibodies and fragments thereof, that 
binds a specific epitope. The term encompasses polyclonal, monoclonal, and chimeric 
antibodies, the last mentioned described in further detail in U.S. Patent Nos. 4,8 1 6,397 
and 4,816,567. 

20 

An "antibody combining site" is that structural portion of an antibody molecule 
comprised of heavy and light chain variable and hypervariable regions that specifically 
binds antigen. 

25 The phrase "antibody molecule" in its various grammatical forms as used herein 

contemplates both an intact immunoglobulin molecule and an immunologically active 
portion of an immunoglobulin molecule. 

Exemplary antibody molecules are intact immunoglobulin molecules, substantially 
30 intact immunoglobulin molecules and those portions of an immunoglobulin molecule 
that contains the paratope, including those portions known in the art as Fab, Fab', 
F(ab') 2 and F(v), which portions are preferred for use in the therapeutic methods 
described herein. 
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Fab and F(ab , ) 2 portions of antibody molecules are prepared by the proteolytic reaction 
of papain and pepsin, respectively, on substantially intact antibody molecules by 
methods that are well-known. See for example, U.S. Patent No. 4,342,566 to 
Theofilopolous et al. Fab' antibody molecule portions are also well-known and are 
5 produced from F(ab') 2 portions followed by reduction of the disulfide bonds linking the 
two heavy chain portions as with mercaptoethanol, and followed by alkylation of the 
resulting protein mercaptan with a reagent such as iodoacetamide. An antibody 
containing intact antibody molecules is preferred herein. 

10 The phrase "monoclonal antibody" in its various grammatical forms refers to an 
antibody having only one species of antibody combining site capable of 
immunoreacting with a particular antigen. A monoclonal antibody thus typically 
displays a single binding affinity for any antigen with which it immunoreacts, A 
monoclonal antibody may therefore contain an antibody molecule having a plurality of 

15 antibody combining sites, each immunospecific for a different antigen; e.g., a bispecific 
(chimeric) monoclonal antibody. 

Various procedures known in the art may be used for the production of polyclonal 
antibodies to polypeptide or derivatives or analogs thereof {see, e.g., Antibodies — A 

20 Laboratory Manual, Harlow and Lane, eds., Cold Spring Harbor Laboratory Press: 
Cold Spring Harbor, New York, 1988). For the production of antibody, various host 
animals can be immunized by injection with the Group B streptococcal Ema 
polypeptide, an immunogenic fragment thereof, or a derivative (e.g., fragment or 
fusion protein) thereof, including but not limited to rabbits, mice, rats, sheep, goats, 

25 etc. In one embodiment, the polypeptide can be conjugated to an immunogenic 
carrier, e.g., bovine serum albumin (BSA) or keyhole limpet hemocyanin (KLH). 
Various adjuvant may be used to increase the immunological response, depending on 
the host species. 

30 For preparation of monoclonal antibodies, or fragment, analog, or derivative thereof, 
any technique that provides for the production of antibody molecules by continuous 
cell lines in culture may be used (see, e.g., Antibodies — A Laboratory Manual, 
Harlow and Lane, eds., Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 
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New York, 1988). These include but are not limited to the hybridoma technique 
originally developed by Kohler and Milstein (1975, Nature 256:495-497), as well as 
the trioma technique, the human B-cell hybridoma technique (Kozbor et al., 1983, 
Immunology Today 4:72), and the EBV-hybridoma technique to produce human 
5 monoclonal antibodies (Cole et al., 1985, in Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96). Monoclonal antibodies can be produced in 
germ-free animals utilizing recent technology (PCT/US90/02545). Human antibodies 
may be used and can be obtained by using human hybridomas (Cote et al, 1983, Proc. 
Natl Acad. Sci. U.S.A. 80:2026-2030) or by transforming human B cells with EBV 

10 virus in vitro (Cole et al., 1985, in Afonoclonal Antibodies and Cancer Therapy, Alan 
R. Liss, pp. 77-96). In fact, according to the invention, techniques developed for the 
production of "chimeric antibodies" (Morrison et al , 1984, J. Bacteriol 159-870; 
Neuberger etal, 1984, Nature 312:604-608; Takedae/ al, 1985, Nature 3 14:452- 
454) by splicing the genes from a mouse antibody molecule specific for a polypeptide 

15 together with genes from a human antibody molecule of appropriate biological activity 
can be used; such antibodies are within the scope of this invention. Such human or 
humanized chimeric antibodies are preferred for use in therapy of human infections or 
diseases, since the human or humanized antibodies are much less likely than xenogenic 
antibodies to induce an immune response, in particular an allergic response, 

20 themselves. An additional embodiment of the invention utilizes the techniques 

described for the construction of Fab expression libraries (Huse et al, 1989, Science 
246:1275-1281) to allow rapid and easy identification of monoclonal Fab fragments 
with the desired specificity for the polypeptide, or its derivatives, or analogs. 

25 Antibody fragments which contain the idiotype of the antibody molecule can be 
generated by known techniques. For example, such fragments include but are not 
limited to: the F(ab f ) 2 fragment which can be produced by pepsin digestion of the 
antibody molecule; the Fab 1 fragments which can be generated by reducing the 
disulfide bridges of the F(ab') 2 fragment, and the Fab fragments which can be 

30 generated by treating the antibody molecule with papain and a reducing agent. 

In the production of antibodies, screening for the desired antibody can be * 
accomplished by techniques known in the art, e.g. , radioimmunoassay, ELIS A 
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(enzyme-linked immunosorbant assay), "sandwich" immunoassays, immunoradiometric 
assays, gel diffusion precipitin reactions, immunodiffusion assays, in situ 
immunoassays (using colloidal gold, enzyme or radioisotope labels, for example), 
western blots, precipitation reactions, agglutination assays (e.g., gel agglutination 
5 assays, hemagglutination assays), complement fixation assays, immunofluorescence 
assays, protein A assays, and Immunoelectrophoresis assays, etc. In one embodiment, 
antibody binding is detected by detecting a label on the primary antibody. In another 
embodiment, the primary antibody is detected by detecting binding of a secondary 
antibody or reagent to the primary antibody. In a further embodiment, the secondary 
10 antibody is labeled. Many means are known in the art for detecting binding in an 
immunoassay and are within the scope of the present invention. 

Antibodies can be labeled for detection in vitro, e.g., with labels such as enzymes, 
fluorophores, chromophores, radioisotopes, dyes, colloidal gold, latex particles, and 
15 chemiluminescent agents. Alternatively, the antibodies can be labeled for detection in 
vivo, e.g., with radioisotopes (preferably technetium or iodine); magnetic resonance 
shift reagents (such as gadolinium and manganese); or radio-opaque reagents. 

The labels most commonly employed for these studies are radioactive elements, 
20 enzymes, chemicals which fluoresce when exposed to ultraviolet light, and others. A 
number of fluorescent materials are known and can be utilized as labels. These 
include, for example, fluorescein, rhodamine, auramine, Texas Red, AMCA blue and 
Lucifer Yellow. A particular detecting material is anti-rabbit antibody prepared in 
goats and conjugated with fluorescein through an isothiocyanate. The polypeptide can 
25 also be labeled with a radioactive element or with an enzyme. The radioactive label 
can be detected by any of the currently available counting procedures. The preferred 
isotope may be selected from 3 H, 14 C, 32 P, 35 S, 36 C1, 51 Cr, 57 Co, 58 Co, 59 Fe, 90 Y, 125 I, I31 I, 
and 186 Re. 

30 Enzyme labels are likewise useful, and can be detected by any of the presently utilized 
calorimetric, spectrophotometric, fluorospectrophotometric, amperometric or 
gasometric techniques. The enzyme is conjugated to the selected particle*by reaction 
with bridging molecules such as carbodiimides, diisocyanates, glutaraldehyde and the 
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like. Many enzymes which can be used in these procedures are known and can be 
utilized. The preferred are peroxidase, B-glucuronidase, B-D-glucosidase, 
B-D-galactosidase, urease, glucose oxidase plus peroxidase and alkaline phosphatase. 
U.S. Patent Nos. 3,654,090; 3,850,752; and 4,016,043 are referred to by way of 
5 example for their disclosure of alternate labeling material and methods. 

Diagnostic Applications 



10 The present invention also relates to a variety of diagnostic applications, including 

methods for identifying or monitoring streptococcal infections. The present invention 
also relates to a variety of diagnostic applications, including methods for identifying or 
monitoring Group B streptococcal infections. The present invention further relates to 
diagnostic applications or methods utilizing the polypeptides of the present invention, 

15 immunogenically recognized fragments thereof, or antibodies thereto. Such methods 
include the analysis and evaluation of agents, analogs or compounds which modulate 
the activity of the Ema polypeptides. The Ema polypeptides may also be utilized in 
diagnostic methods and assays for monitoring and determining immunological 
response and antibody response upon streptococcal infection or vaccination. 

20 

As described in detail above, antibody(ies) to the Ema polypeptides or fragments 
thereof can be produced and isolated by standard methods including the well known 
hybridoma techniques. For convenience, the antibody(ies) to the Ema polypeptides 
will be referred to herein as A^ and antibody(ies) raised in another species as Ab 2 . 

25 

The presence of streptococci in cells can be ascertained by the usual immunological 
procedures applicable to such determinations. A number of useful procedures are 
known. Procedures which are especially useful utilize either the Ema polypeptides 
labeled with a detectable label, antibody against the Ema polypeptides labeled with a 
30 detectable label, or secondary antibody labeled with a detectable label. 

The procedures and their application are all familiar to those skilled in the. art and 
accordingly may be utilized within the scope of the present invention. The 
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"competitive" procedure, is described in U.S. Patent Nos. 3,654,090 and 3,850,752. 
The "sandwich" procedure, is described in U.S. Patent Nos. RE 3 1,006 and 4,016,043. 
Still other procedures are known such as the "double antibody," or "DASP" 
procedure. 

5 

In each instance, the Ema polypeptides forms complexes with one or more 
antibody(ies) or binding partners and one member of the complex is labeled with a 
detectable label. The fact that a complex has formed and, if desired, the amount 
thereof, can be determined by known methods applicable to the detection of labels. 

10 

In a further embodiment of this invention, commercial test kits suitable for use by a 
medical specialist may be prepared to determine the presence or absence of 
stretococci, particularly of streptococci expressing one or more Ema polypeptide 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. In as much as the 

15 ema locus, as described herein, is found in the genomic DNA of many, if not all, 
serotypes of Group B streptococci, it is a useful general marker for Group B 
streptococci. In as much as Ema homologs exist in other species of streptococci, 
including Group A and S. pneumoniae, it is a useful general marker for streptococci. 
Therefore, commercial test kits for determining the presence or absence of 

20 streptococci, and thereby determining whether an individual is infected with 

streptococci are contemplated and provided by this invention. Therefore, commercial 
test kits for determining the presence or absence of Group B streptococci, and thereby 
determining whether an individual is infected with Group B streptococci are 
contemplated and provided by this invention. 

25 

The present invention includes methods for determining and monitoring infection by 
streptococci by detecting the presence of a streptococcal polypeptide selected from the 
group of EmaA, EmaB, EmaC, EmaD and EmaE. In a particular such method, the 
streptococcal Ema polypeptide is measured by: 
30 a. contacting a sample in which the presence or activity of a Streptococcal 

polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD 
and EmaE is suspected with an antibody to the said streptococcal 
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polypeptide under conditions that allow binding of the streptococcal 
polypeptide to the antibody to occur; and 
b. detecting whether binding has occurred between the streptococcal 
polypeptide from the sample and the antibody; 
5 wherein the detection of binding indicates the presence or activity of the streptococcal 
polypeptide in the sample. 



The present invention includes methods for determining and monitoring infection by 
10 Group B streptococci by detecting the presence of a Group B streptococcal 

polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. In a 
particular such method, the streptococcal Ema polypeptide is measured by: 

a. contacting a sample in which the presence or activity of a Group B 
Streptococcal polypeptide selected from the group of EmaA, EmaB, 
15 EmaC, EmaD and EmaE is suspected with an antibody to the said 

Group B streptococcal polypeptide under conditions that allow binding 
of the Group B streptococcal polypeptide to the antibody to occur; and 



b. detecting whether binding has occurred between the Group B 
20 streptococcal polypeptide from the sample and the antibody; 

wherein the detection of binding indicates the presence or activity of the a Group B 
streptococcal polypeptide in the sample. 

The present invention further provides a method for detecting the presence of a 
25 bacterium having a gene encoding a Group B polypeptide selected from the group of 
ema A, emaB, emaC, emaD and emaE, comprising: 

a. contacting a sample in which the presence or activity of the bacterium 
is suspected with an oligonucleotide which hybridizes to a Group B 
streptococcal polypeptide gene selected from the group of emaA, 

30 emaB, emaC, emaD and emaE 7 under conditions that allow specific 

hybridization of the oligonucleotide to the gene to occur; and 

b. detecting whether hybridization has occurred between the * 
oligonucleotide and the gene; 
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wherein the detection of hybridization indicates that presence or activity of the 
bacterium in the sample. 

The invention includes an assay system for screening of potential compounds effective 
5 to modulate the activity of a bacterial Ema protein of the present invention. In one 
instance, the test compound, or an extract containing the compound, could be 
administered to a cellular sample expressing the particular Ema protein to determine 
the compound's effect upon the activity of the protein by comparison with a control. 
In a further instance the test compound, or an extract containing the compound, could 
10 be administered to a cellular sample expressing the Ema protein to determine the 
compound's effect upon the activity of the protein, and thereby on adherence of said 
cellular sample to host cells, by comparison with a control. 

Accordingly, a test kit may be prepared for the demonstration of the presence of Ema 
15 polypeptide or Ema activity in cells, comprising: 

(a) a predetermined amount of at least one labeled immunochemically reactive 
component obtained by the direct or indirect attachment of the Ema polypeptide or a 
specific binding partner thereto, to a detectable label; 

(b) other reagents; and 

20 (c) directions for use of said kit. 

More specifically, the diagnostic test kit may comprise: 

(a) a known amount of the Ema polypeptide as described above (or a binding 
partner) generally bound to a solid phase to form an immunosorbent, or in the 

25 alternative, bound to a suitable tag, or plural such end products, etc. (or their binding 
partners) one of each; 

(b) if necessary, other reagents; and 

(c) directions for use of said test kit. 

30 In a further variation, the test kit may be prepared and used for the purposes stated 
above, which operates according to a predetermined protocol (e.g. "competitive," 
"sandwich," "double antibody," etc.), and comprises: 
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(a) a labeled component which has been obtained by coupling the Ema polypeptide 
to a detectable label; 

(b) one or more additional immunochemical reagents of which at least one reagent 
is a ligand or an immobilized ligand, which ligand is selected from the group consisting 

5 of: 

(i) a ligand capable of binding with the labeled component (a); 

(ii) a ligand capable of binding with a binding partner of the labeled 
component (a); 

(iii) a ligand capable of binding with at least one of the component(s) to be 
10 determined; and 

(iv) a ligand capable of binding with at least one of the binding partners of 
at least one of the component(s) to be determined; and 

(c) directions for the performance of a protocol for the detection and/or 
determination of one or more components of an immunochemical reaction between the 

15 Ema polypeptide and a specific binding partner thereto. 

In accordance with the above, an assay system for screening potential drugs effective 
to modulate the activity of the Ema polypeptide may be prepared. The Ema 
polypeptide may be introduced into a test system, and the prospective drug may also 
20 be introduced into the resulting cell culture, and the culture thereafter examined to 
observe any changes in the Ema polypeptide activity of the cells, due either to the 
addition of the prospective drug alone, or due to the effect of added quantities of the 
known Ema polypeptide. 

25 Therapeutic Applications 

The therapeutic possibilities that are raised by the existence of the Group B 
streptococcal Ema polypeptides EmaA, EmaB, EmaC, EmaD and EmaE derive from 
the fact that the Ema polypeptides of the present invention are found generally in 
30 various serotypes of Group B streptococci. In addition, broader therapeutic 

possibilities that are raised by the existence of Ema homologous polypeptides in 
various distinct species of streptococci, including S. pneumoniae and S. pyogenes. In 
addition Ema homologous polypeptides have been identified in E. faecalis and C. 
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diptheriae. Of particular relevance to their suitability in vaccine and immunological 
therapy is that the Ema A, EmaB, and EmaC polypeptides possess N-terminal 
sequences consistent with a signal peptide, indicating secretion from the bacterial cell 
and at least partial extracellular localization. In addition, the EmaA, EmaB, EmaC, 
5 EmaD and EmaE polypeptides demonstrate homology to distinct bacterial proteins 
involved in or implicated in bacterial adhesion and invasion. Thus, the Ema 
polypeptides are anticipated to be involved in or required for streptococcal adhesion to 
and/or invasion of cells, critical for bacterial survival and virulence in the human host. 

1 0 Modulators of Extracellular Matrix Adhesin Protein 

Thus, in instances where it is desired to reduce or inhibit the effects resulting from the 
extracellular matrix adhesin protein Ema of the present invention, an appropriate 
inhibitor of one or more of the Ema proteins, particularly EmaA, EmaB, EmaC, EmaD 
15 and EmaE could be introduced to block the activity of one or more Ema protein. 

The present invention contemplates screens for a modulator of an Ema polypeptide, in 
particular modulating adhesion or invasion facilitated by EmaA, EmaB, EmaC, EmaD 
or EmaE. In one such embodiment, an expression vector containing the Ema 

20 polypeptide of the present invention, or a derivative or analog thereof, is placed into a 
cell in the presence of at least one agent suspected of exhibiting Ema polypeptide 
modulator activity. The cell is preferably a bacterial cell, most preferably a 
streptococcal cell, or a bacterial host cell. The amount of adhesion or binding activity 
is determined and any such agent is identified as a modulator when the amount of 

25 adhesion or binding activity in the presence of such agent is different than in its 

absence. The vectors may be introduced by any of the methods described above. In a 
related embodiment the GBS Ema polypeptide is expressed in streptococci and the 
step of determining the amount of adhesion or binding activity is performed by 
determining the amount of binding to bacterial host cells cells in vitro. 

30 

When the amount of adhesion or binding activity in the presence of the modulator is 
greater than in its absence, the modulator is identified as an agonist or activator of the 
Ema polypeptide, whereas when the amount of adhesion binding activity in the 
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presence of the modulator is less than in its absence, the modulator is identified as an 
antagonist or inhibitor of the Ema polypeptide. As any person having skill in the art 
would recognize, such determinations as these and those below could require some 
form of statistical analysis, which is well within the skill in the art. 

5 

Natural effectors found in cells expressing Ema polypeptide can be fractionated and 
tested using standard effector assays as exemplified herein, for example. Thus an 
agent that is identified can be a naturally occurring adhesion or binding modulator. 
Alternatively, natural products libraries can be screened using the assays of the present 
10 invention for screening such agents. % 

Another approach uses recombinant bacteriophage to produce large libraries. Using 
the "phage method" [Scott and Smith, 1990, Science 249:386-390 (1990); Cwirla, et 
al., Proc. Natl Acad Sci., 87:6378-6382 (1990); Devlin et al., Science, 249:404-406 

15 (1990)], very large libraries can be constructed (10 6 -10 8 chemical entities). Yet 
another approach uses primarily chemical methods, of which the Geysen method 
[Geysen et al., Molecular Immunology 23:709-715 (1986); Geysen et al. J. 
Immunologic Method 102:259-274 (1987)] and the method of Fodor et al. [Science 
251:767-773 (1991)] are examples. Furka et al. [14th International Congress of 

20 Biochemistry, Volume 5, Abstract FR:013 (1988); Furka, Int. J. Peptide Protein Res. 
37:487-493 (1991)], Houghton [U.S. Patent No. 4,631,211, issued December 1986] 
and Rutter et al. [U.S. Patent No. 5,010,175, issued April 23, 1991] describe methods 
to produce a mixture of peptides that can be tested. 

25 In another aspect, synthetic libraries [Needels et al., Proc. Natl Acad. Sci. USA 
90:10700-4 (1993); Ohlmeyer et al., Proc. Natl Acad. Sci. USA 90:10922-10926 
(1993); Lam et al, International Patent Publication No. WO 92/00252; Kocis et al, 
International Patent Publication No. WO 9428028, each of which is incorporated 
herein by reference in its entirety], and the like can be used to screen for such an agent. 

30 

This invention provides antagonist or blocking agents which include but are not limited 
to: peptide fragments, mimetic, a nucleic acid molecule, a ribozyme, a polypeptide, a 
small molecule, a carbohydrate molecule, a monosaccharide, an oligosaccharide or an 
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antibody. Also, agents which competitively block or inhibit streptococcal bacterium 
are contemplated by this invention. This invention provides an agent which comprises 
an inorganic compound, a nucleic acid molecule, an oligonucleotide, an organic 
compound, a peptide, a peptidomimetic compound, or a protein which inhibits the 
5 polypeptide. 

Vaccines 

10 In a further aspect, the present invention extends to vaccines based on the Ema 

proteins described herein. The present invention provides a vaccine comprising one or 
more Group B streptococcal polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE, and a pharmaceutically acceptable adjuvant. The present 
invention provides a vaccine comprising one or more bacterial Ema polypeptide 

15 selected from the group of polypeptides comprising the amino acid sequence set out in 
any of SEQ ID NO: 23, 26, 29, 32 and 37, and a pharmaceutically acceptable 
adjuvant. 

The present invention further provides a vaccine comprising one or more Group B 
20 streptococcal polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE, further comprising one or more additional GBS antigen. The present 
invention further provides a vaccine comprising one or more Group B streptococcal 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, 
further comprising one or more antigens selected from the group of the polypeptide 
25 Spbl or an immunogenic fragment thereof, the polypeptide Spb2 or an immunogenic 
fragment thereof, C protein alpha antigen or an immunogenic fragment thereof, Rib or 
an immunogenic fragment thereof, Lmb or an immunogenic fragment thereof, C5a-ase 
or an immunogenic fragment thereof, and Group B streptococcal polysaccharides or 
oligosaccharides. 

30 

In another aspect, the invention is directed to a vaccine for protection of an animal 
subject from infection with streptococci comprising an immunogenic amount of one or 
more streptococcal Ema polypeptide, or a derivative or fragment thereof. The Ema 
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polypeptide may be particularly selected from the group of EmaA, EmaB, EmaC, 
EmaD or EmaE, or a derivative or fragment thereof. In a further aspect, the invention 
is directed to a vaccine for protection of an animal subject from infection with 
streptococci comprising an immunogenic amount of one or more Ema polypeptide 
5 EmaA, EmaB, EmaC, EmaD or EmaE, or a derivative or fragment thereof. In a 

further aspect, the invention is directed to a vaccine for protection of an animal subject 
from infection with GBS comprising an immunogenic amount of one or more Ema 
polypeptide EmaA, EmaB, EmaC, EmaD or EmaE, or a derivative or fragment 
thereof Such a vaccine may contain the protein conjugated covalently to a 
10 streptococcal or GBS bacterial polysaccharide or oligosaccharide or polysaccharide or 
oligosaccharide from one or more streptococcal or GBS serotypes. 

This invention provides a vaccine which comprises a polypeptide bacterial Ema 
protein and a pharmaceutically acceptable adjuvant or carrier. In particular, a vaccine 
is provided which comprises one or more Ema polypeptides selected from the group of 
EmaA, EmaB, EmaC, EmaD and EmaE. This invention provides a vaccine which 
comprises a combination of at least one bacterial Ema protein selected from the group 
of EmaA, EmaB, EmaC, EmaD and EmaE and at least one other Group B 
streptococcal protein particularly Spbl and/or Spb2 and/or C protein alpha antigen, 
and a pharmaceutically acceptable adjuvant or carrier. The Ema polypeptide may 
comprise an amino acid sequence of a Ema protein EmaA, EmaB, EmaC, EmaD, 
EmaE as set forth in FIGURES 2-6 and SEQ ID NOS: 2, 4, 6, 8 and 10. 

This invention further provides a vaccine comprising an isolated nucleic acid encoding 
25 a bacterial Ema polypeptide and a pharmaceutically acceptable adjuvant or carrier. 
This invention further provides a vaccine comprising an isolated nucleic acid encoding 
a streptococcal Ema polypeptide and a pharmaceutically acceptable adjuvant or 
carrier. This invention further provides a vaccine comprising an isolated nucleic acid 
encoding a GBS Ema polypeptide and a pharmaceutically acceptable adjuvant or 
30 carrier. This invention further provides a vaccine comprising isolated nucleic acid 
encoding one or more GBS Ema polypeptide, particularly selected from the group of 
EmaA, EmaB, EmaC, EmaD and EmaE and a pharmaceutically acceptable adjuvant 
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or carrier. The nucleic acid may comprise a nucleic acid sequence of a GBS Ema 
polypeptide as set forth in any of SEQ ID NOS:l, 3, 5, 7, or 9. 

Active immunity against streptococci can be induced by immunization (vaccination) 
5 with an immunogenic amount of the polypeptide, or peptide derivative or fragment 
thereof, and an adjuvant, wherein the polypeptide, or antigenic derivative or fragment 
thereof, is the antigenic component of the vaccine. The polypeptide, or antigenic 
derivative or fragment thereof, may be one antigenic component, in the presence of 
other antigenic components in a vaccine. For instance, the polypeptide of the present 

10 invention may be combined with other known streptococcal polypeptides or 

pofy/oligo saccharides, or immunogenic fragments thereof, including for instance GBS 
capsular polysaccharide, Spbl, Spb2, C protein alpha antigen, Rib, Lmb, and C5a-ase 
in a multi-component vaccine. Such multi-component vaccine may be utilized to 
enhance immune response, even in cases where the polypeptide of the present 

15 invention elicits a response on its own. The polypeptide of the present invention may 
also be combined with existing vaccines, whole bacterial or capsule-based vaccines, 
alone or in combination with other GBS polypeptides, particularly Spbl and/or Spb2 * 
and/or C protein alpha antigen and/or Rib to enhance such existing vaccines. 

20 The term "adjuvant" refers to a compound or mixture that enhances the immune 

response to an antigen. An adjuvant can serve as a tissue depot that slowly releases 
the antigen and also as a lymphoid system activator that non-specifically enhances the 
immune response (Hood et al., Immunology, Second Ed, 1984, Benjamin/Cummings: 
Menlo Park, California, p. 384). Often, a primary challenge with an antigen alone, in 

25 the absence of an adjuvant, will fail to elicit a humoral or cellular immune response. 
Adjuvant include, but are not limited to, complete Freund's adjuvant, incomplete 
Freund ! s adjuvant, saponin, mineral gels such as aluminum hydroxide, surface active 
substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil or 
hydrocarbon emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially 

30 useful human adjuvant such as BCG (bacille Calmette-Gueriri) and Coryne bacterium 
parvum. Preferably, the adjuvant is pharmaceutical^ acceptable. 
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The invention further provides a vaccine which comprises a non-adherent, non- virulent 
mutant, including but not limited to the ema mutants herein described and 
contemplated. Medaglini et al (Madaglini et al (1 995) Proc Natl Acad Sci USA 
92;6868-6872) and Oggioni and Pozzi (Oggioni, M.R. and Pozzi, G. (1996) Gene 
5 1 69: 85-90) have previously described the use of Streptococcus gordonii, a commensal 
bacterium of the human oral cavity, as live vaccine delivery vehicles and for 
heterologous gene expression. Such etna' mutant can therefore be utilized as a 
vehicle for expression of immunogenic proteins for the purposes of eliciting an 
immune response to such other proteins in the context of vaccines. Active immunity 
10 against Group B streptococci, can b,e induced by immunization (vaccination) with an 
immunogenic amount of the ema vehicle expressing an immunogenic protein. Also 
contemplated by the present invention is the use of any such ema' mutant in 
expressing a therapeutic protein in the host in the context of other forms of therapy. 

15 The polypeptide of the present invention, or fragments thereof, can be prepared in an 
admixture with an adjuvant to prepare a vaccine. Preferably, the polypeptide or 
peptide derivative or fragment thereof, used as the antigenic component of the 
vaccine is an antigen common to all or many serotypes of GBS bacteria, or common to 
closely related species of bacteria, for instance Streptococcus. 

20 

Vectors containing the nucleic acid-based vaccine of the invention can be introduced 
into the desired host by methods known in the art, e.g., transfection, electroporation, 
micro injection, transduction, cell fusion, DEAE dextran, calcium phosphate 
precipitation, lipofection (lysosome fusion), use of a gene gun, or a DNA vector 
25 transporter (see, e.g., Wu et al, 1992, J. Biol. Chem. 267:963-967; Wu and Wu, 

1988, J. Biol. Chem. 263:14621-14624; Hartmut et al, Canadian Patent Application 
No. 2,012,311, filed March 15, 1990). 

The modes of administration of the vaccine or compositions of the present invention 
30 may comprise the use of any suitable means and/or methods for delivering the vaccine 
or composition to the host animal whereby they are immumostimulatively effective. 
Delivery modes may include, without limitation, parenteral administration* methods, 
such as paracancerally, transmucosally, transdermally, intramuscularly, intravenously, 
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intradermally, subcutaneously, intraperitonealy, intraventricularly, intracranially and 
intratumorally. Preferably, since the desired result of vaccination is to elucidate an 
immune response to the antigen, and thereby to the pathogenic organism, 
administration directly, or by targeting or choice of a viral vector, indirectly, to 
5 lymphoid tissues, e.g., lymph nodes or spleen, is desirable. Since immune cells are 
continually replicating, they are ideal target for retroviral vector-based'nucleic acid 
vaccines, since retroviruses require replicating cells. These vaccines and compositions 
can be used to immunize mammals, for example, by the intramuscular or parenteral 
routes, or by delivery to mucosal surfaces using microp articles, capsules, liposomes 

10 and targeting molecules, such as toxins and antibodies. The vaccines and 

immunogenic compositions may be administered to mucosal surfaces by, for example, 
the nasal or oral (intragastric) routes. Alternatively, other modes of administration 
including suppositories may be desirable. For suppositories, binders and carriers may 
include, for example, polyalkylene glycols and triglycerides. Oral formulations may 

15 include normally employed incipients, such as pharmaceutical grades of saccharine, 
cellulose and magnesium carbonate. 

These compositions may take the form of solutions, suspensions, tablets, pills, 
capsules, sustained release formulations or powders and contain 1 to 95% of the 

20 immunogenic compositions of the present invention. The immunogenic compositions 
are administered in a manner compatible with the dosage formulation, and in such 
amount as to be therapeutically effective, protective and immunogenic. The quantity to 
be administered depends on the subject to the immunized, including, for example, the 
capacity of the subject's immune system to synthesize antibodies, and if needed, to 

25 produce a cell-mediated, humoral or antibody-mediated immune response. Precise 
amounts of antigen and immunogenic composition to be administered depend on the 
judgement of the practitioner. However, suitable dosage ranges are readily 
determinable by those skilled in the art and may be of the order of micrograms to 
milligrams. Suitable regimes for initial administration and booster doses are also 

30 variable, but may include an initial administration followed by subsequent 
administrations. The dosage of the vaccine may also depend on the route of 
administration and will vary according to the size of the host. 
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Passive immunity can be conferred to an animal subject suspected of suffering an 
infection with streptococci by administering antiserum, polyclonal antibodies, or a 
neutralizing monoclonal antibody against one or more Ema polypeptide of the 
invention to the patient. A combination of antibodies directed against one or more 
5 Ema polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, 
in combination with one or more of antibodies against Spbl, Spb2, Rib and C protein 
alpha antigen is also contemplated by the present invention. Although passive 
immunity does not confer long term protection, it can be a valuable tool for the 
treatment of a bacterial infection in a subject who has not been vaccinated. Passive 

10 immunity is particularly important for the treatment of antibiotic resistant strains of 
bacteria, since no other therapy may be available. Preferably, the antibodies 
administered for passive immune therapy are autologous antibodies. For example, if 
the subject is a human, preferably the antibodies are of human origin or have been 
"humanized," in order to minimize the possibility of an immune response against the 

15 antibodies. The active or passive vaccines of the invention can be used to protect an 
animal subject from infection by streptococcus, particularly Group B streptococcus. 

Vaccines for GBS have been previously generated and tested. Preliminary vaccines 
used unconjuated purified polysaccaride. GBS polysaccharides and oligosaccharides 

20 are poorly immunogenic and fail to elicit significant memory and booster responses. 
Baker et al immunized 40 pregnant women with purified serotype III capsular 
polysaccharide (Baker, CJ. et al. (1998) New EnglJ of Med 319:1 180-1 185). 
Overall, only 57% of women with low levels of specific antibody responded to the the 
vaccine. The poor immunogenicity of purified polysaccharide antigen was further 

25 demonstrated in a study in which thirty adult volunteers were immunized with a 

tetravalent vaccine composed of purified polysaccharide from serotypes la, lb, II, and 
III (Kotloff, K.L. et al. (1996) Vaccine 14:446-450). Although safe, this vaccine was 
only modestly immunogenic, with only 13% of subjects responding to type lb, 17% to 
type II, 33% responding to type la, and 70%responding to type III polysaccharide. 

30 The poor immunogenicity of polysaccaride antigens prompted efforts to develop 

polysaccharide conjugate vaccines, whereby these polysaccharides or oligosaccharides 
are conjugated to protein carriers. Ninety percent of healthy adult women immunized 
with a type III polysaccharide-tetanus toxoid conjugate vaccine responded with a 
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4-fold rise in antibody concentration, compared to 50% immunized with plain 
polysaccharide (Kasper, D.L. et al (1996) J of Clin Invest 98:2308-23 14). A type 
Ia/Ib polysaccharide-tetanus toxoid conjugate vaccine was similarly more 
immunogenic in healthy adults than plain polysaccharide (Baker, C.J. et al (1999) J 
5 Infect Dis 119:142-150). 



The general method for the conjugation of polysaccharide is described in Wessels et al 
(Wessels, M R. et al (1990) J. Clin Investigation 86: 1428-1433). Prior to coupling 
with tetanus toxoid, aldehyde groups are introduced on the polysaccharide by 

10 controlled periodate oxidation, resulting in the conversion of a portion of the sialic 
acid residues of the polysaccharide to residues of the 8-carbon analogue of sialic acid, 
5-acetamido-3,5-dideoxy-D-galactosyloctulosonic acid. Tetanus toxoid is conjugated 
to the polysaccharide by reductive amination using free aldehyde groups present on the 
partially oxidized sialic acid residues. The preparation and conjugation of 

15 oligosaccharides is described in Paoletti et al (Paoletti, L.C. et al (1 990) J. Biol Chem 
265: 18278-18283). Purified capsular polysaccharide is depolymerized by enzymatic 
digestion using endo-beta-galactosidase produced by Citrobacter freundii. Following 
digestion, oligosaccharides are fractionated by gel filtration chromatography. Tetanus 
toxoid was covalently coupled via a synthetic spacer molecule to the reducing end of 

20 the oligosaccharide by reductive amination. 

Methods and vaccines comprising GBS conjugate vaccines, comprising capsular 
polysaccharide and protein are provided and described in U.S. Patent 5, 993,825, 
5,843,461, 5,795,580, 5,302,386 and 4,356,263, which are incorporated herein by 
25 reference in their entirety. These conjugate vaccines include polysaccharide-tetanus 
toxoid conjugate vaccines. 

One polypeptide proposed to be utilized in a GBS vaccine is the repetitive GBS 
C protein alpha antigen, which contains up to nine tandemly repeated units of 82 
30 amino acids (Michel, J.K. et al (1992) PNAS USA 89: 10060-10064). The 

polypeptide, methods and vaccines thereof, including polysaccharide-conjugate 
vaccines generated therewith, are provided and described in U.S. Patent 5,968,521, 
5,908,629, 5,858,362, 5,847,081, 5,843,461, 5,843,444, 5,820,860, and 5,648,241, 
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which are herein incorporated by reference in their entirety. Antibodies generated 
against C protein alpha antigen with a large numbers of repeats protect against 
infection, but GBS are able to change the structure of the protein by deleting one or 
more of the repeat regions and escape detection by these antibodies (Madoff, L.C. et 
5 al (1996) PNAS USA 93: 4131-4136). This effect could theoretically be prevented by 
immunization with a protein with a lower number of repeat units, but the 
immunogenicity of the C protein alpha antigen is inversely related to the number of 
repeats - 65% of mice responded to immunization with the 9-repeat protein, but only 
1 1% to a 1 -repeat protein (Gravekamp, C. et al (1997) Infect Immunity 65: 5216- 
10 5221). This is a disadvantage with <my protein with a repetitive structure - it is 

common for bacteria to be able to alter or reassort these genes to alter the proteins 
exposed on their surface. 

Typical doses for a vaccine composed of a protein antigen are in the range of 2.5-50 
15 ug of total protein per dose. Typical doses for a polysaccharide-protein conjugate 
vaccine are 7.5-25 ug of polysaccharide and 1.25-250 ug of carrier protein. These 
types of vaccines are almost always given intramuscularly. Dosing schedules of a 
vaccine can be readily determined by the skilled artisan, particularly by comparison of 
similar vaccines, including other GBS vaccines. If used as a universal vaccine, a GBS 
20 vaccine would be integrated into the routine immunization schedule. Most similar 
vaccines require a primary series of immunizations (usually 2 or 3 doses at 2 month 
intervals beginning at lor 2 months of age) and a single booster at 12-18 months of 
age. A smaller number of doses or a single dose may be adequate in older children 
(over a year of age). For immunization of pregnant women, an exemplary 
25 immunization schedule would be a single dose given in the second or early third 

trimester. For immunization of non-pregnant adults, a single dose would probably be 
used. The requirement for subsequent booster doses in adults is difficult to predict - 
this would be based on the immunogenicity of the vaccine and ongoing surveillance of 
vaccine efficacy. 

30 

Immunogenic Compositions 



WO 02/12294 



PCT/US01/24795 



72 

In a further aspect, the present invention provides an immunogenic composition 
comprising one of more bacterial Ema polypeptides. In a still further aspect, the 
present invention provides an immunogenic composition comprising one of more 
streptococcal Ema polypeptides. In a particular aspect, the present invention provides 
5 an immunogenic composition comprising one of more Group B streptococcal 

polypeptides selected from the group of EmaA, EmaB, EmaC, EmaD, EmaE and a 
fragment thereof, and a pharmaceutically accpetable adjuvant. Immunogenic 
compositions may comprise a combination of one or more Group B Ema polypeptide, 
or an immunogenic polypeptide fragment thereof, with one or more additional GBS 
10 polypeptide or GBS capsular polysaccharide or oligosaccharide. 

The present invention further provides an immunogenic composition comprising one 
or more Group B streptococcal polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE, further comprising one or more antigens selected from the 
15 group of the polypeptide Spbl or an immunogenic fragment thereof, the polypeptide 
Spb2 or an immunogenic fragment thereof, C protein alpha antigen or an immunogenic 
fragment thereof, Rib or an immunogenic fragment thereof, and Group B 
streptococcal polysaccharides or oligosaccharides. 

20 Pharmaceutical Compositions 

The invention provides pharmaceutical compositions comprising a bacterial Ema 
polypeptide, particularly a streptococcal Ema polypeptide, and a pharmaceutically 
acceptable carrier. The invention provides pharmaceutical compositions comprising a 

25 Group B streptococcal polypeptide selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE, and a pharmaceutically acceptable carrier. The present invention 
further provides pharmaceutical compositions comprising one or more GBS Ema 
polypeptide, or a fragment thereof, in combination with one or more of GBS 
polypeptide Spbl, Spb2, C protein alpha antigen, Rib, a Group B streptococcal 

30 polysaccharide or oligosaccharide vaccine, and an anti-streptococcal vaccine. 

Such pharmaceutical composition for preventing streptococcal attachment to mucosal 
surface may include antibody to Ema polypeptide EmaA, EmaB, EmaC, EmaD or 
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EmaE or any combination of antibodies to one or more such Ema polypeptide. In 
addition, any such composition may further include antibody to GBS polypeptides 
Spbl, Spb2, C protein alpha antigen, or Rib. Blocking adherence using such antibody 
blocks the initial step in infection thereby reducing colonization. This in turn decreases 
5 person to person transmission and prevents development of symptomatic disease. 

The present invention provides a pharmaceutical composition comprising an antibody 
to a Group B streptococcal protein selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE, and a pharmaceutically acceptable carrier. The invention further 
10 provides a pharmaceutical composition comprising a combination of at least two 
antibodies to Group B streptococcal proteins and a pharmaceutically acceptable 
carrier, wherein at least one antibody to a protein selected from the group of EmaA, 
EmaB, EmaC, EmaD, EmaE,is combined with at least one antibody to a protein 
selected from the group of Spbl, Spb2, Rib, and C protein alpha antigen. 

15 

It is still a further object of the present invention to provide a method for the 
prevention or treatment of mammals to control the amount or activity of streptococci, 
so as to treat or prevent the adverse consequences of invasive, spontaneous, or 
idiopathic pathological states. 

20 

It is still a further object of the present invention to provide a method for the 
prevention or treatment of mammals to control the amount or activity of Group B 
streptococci, so as to treat or prevent the adverse consequences of invasive, 
spontaneous, or idiopathic pathological states. 

25 

The invention provides a method for preventing infection with a bacterium that 
expresses a streptococcal Ema polypeptide comprising administering an 
immunogenically effective dose of a vaccine comprising an Ema polypeptide selected 
from the group of EmaA, EmaB, EmaC, EmaD and EmaE to a subject. 

30 

The invention further provides a method for preventing infection with a bacterium that 
expresses a Group B streptococcal Ema polypeptide comprising administering an 
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immunogenically effective dose of a vaccine comprising an Ema polypeptide selected 
from the group of Ema A, EmaB, EmaC, EmaD and EmaE to a subject. 

The present invention is directed to a method for treating infection with a bacterium 
5 that expresses a Group B streptococcal Ema polypeptide comprising administering a 
therapeutically effective dose of a pharmaceutical composition comprising an Ema 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, and a 
. pharmaceutically acceptable carrier to a subject. 

10 The invention further provides a method for treating infection with a bacterium that 
expresses a Group B streptococcal Ema polypeptide comprising administering a 
therapeutically effective dose of a pharmaceutical composition comprising an antibody 
to an Ema polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE, and a pharmaceutically acceptable carrier to a subject. 

15 

In a further aspect, the invention provides a method of inducing an immune response 
in a subject which has been exposed to or infected with a Group B streptococcal 
bacterium comprising administering to the subject an amount of the pharmaceutical 
composition comprising an Ema polypeptide selected from the group of EmaA, EmaB, 
20 EmaC, EmaD and EmaE, and a pharmaceutically acceptable carrier, thereby inducing 
an immune response. 

The invention still further provides a method for preventing infection by a 
streptococcal bacterium in a subject comprising administering to the subject an amount 
25 of a pharmaceutical composition comprising an antibody to an Ema polypeptide 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE and a 
pharmaceutically acceptable carrier or diluent, thereby preventing infection by a 
streptococcal bacterium. 

30 The invention further provides an ema mutant bacteria which is non-adherent and/or 
non-invasive to cells and which is mutated in one or more genes selected from the 
group of emaA, emaB, emaC, emaD and emaE. Particularly, such ema mutant is a 
Group B streptococcal bacteria. Such non-adherent and/or non-invasive ema mutant 
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bacteria can further be utilized in expressing other immunogenic or therapeutic 
proteins for the purposes of eliciting immune responses to any such other proteins in 
the context of vaccines and in other forms of therapy. 

5 This invention provides a method of inhibiting colonization of host cells in a subject 
which has been exposed to or infected with a streptococcal bacterium comprising 
administering to the subject an amount of a pharmaceutical composition comprising an 
Ema polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, 
thereby inducing an immune response. The therapeutic peptide that blocks 
10 colonization is delivered by the respiratory mucosal. The pharmaceutical composition 
comprises the polypeptide selected from the group of SEQ ID NO: 2, 4, 6, 8 and 10. 

As used herein, "pharmaceutical composition" could mean therapeutically effective 
amounts of polypeptide products or antibodies of the invention together with suitable 

15 diluents, preservatives, solubilizers, emulsifiers, adjuvant and/or carriers useful in 
therapy against bacterial infection or in inducing an immune response. A 
"therapeutically effective amount" as used herein refers to that amount which provides 
a therapeutic effect for a given condition and administration regimen. Such 
compositions are liquids or lyophilized or otherwise dried formulations and include 

20 diluents of various buffer content (e.g., Tris-HCl., acetate, phosphate), pH and ionic 
strength, additives such as albumin or gelatin to prevent absorption to surfaces, 
detergents (e.g., Tween 20, Tween 80, Pluronic F68, bile acid salts), solubilizing 
agents (e.g., glycerol, polyethylene glycerol), anti-oxidants (e.g., ascorbic acid, sodium 
metabisulfite), preservatives (e.g., Thimerosal, benzyl alcohol, parabens), bulking 

25 substances or tonicity modifiers (e.g., lactose, mannitol), covalent attachment of 

polymers such as polyethylene glycol to the protein, complexation with metal ions, or 
incorporation of the material into or onto particulate preparations of polymeric 
compounds such as polylactic acid, polglycolic acid, hydrogels, etc, or onto liposomes, 
microemulsions, micelles, unilamellar or multilamellar vesicles, erythrocyte ghosts, or 

30 spheroplasts. Such compositions will influence the physical state, solubility, stability, 
rate of in vivo release, and rate of in vivo clearance of the polypeptides of the present 
invention. The choice of compositions will depend on the physical and chemical 
properties of the polypeptide. Controlled or sustained release compositions include 
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formulation in lipophilic depots (e.g., fatty acids, waxes, oils). Also comprehended by 
the invention are particulate compositions coated with polymers (e.g., poloxamers or 
poloxamines) and the polypeptides of the present invention coupled to antibodies 
directed against tissue-specific receptors, ligands or antigens or coupled to ligands of 
5 tissue-specific receptors. Other embodiments of the compositions of the invention 
incorporate particulate forms, protective coatings, protease inhibitors or permeation 
enhancers for various routes of administration, including parenteral, pulmonary, nasal 
and oral. 

10 Further, as used herein "pharmaceutical^ acceptable carrier" are well known to those 
skilled in the art and include, but are not limited to, 0.01-0. 1M and preferably 0.0 5M 
phosphate buffer or 0.8% saline. Additionally, such pharmaceutically acceptable 
carriers may be aqueous or non-aqueous solutions, suspensions, and emulsions. 
Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, 

15 vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. 
Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or 
suspensions, including saline and buffered media. Parenteral vehicles include sodium 
chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's or 
fixed oils. Intravenous vehicles include fluid and nutrient replenishers, electrolyte 

20 replenishers such as those based on Ringer's dextrose, and the like. Preservatives and 
other additives may also be present, such as, for example, antimicrobials, antioxidants, 
collating agents, inert gases and the like. 

* 

The phrase "pharmaceutically acceptable" refers to molecular entities and 
25 compositions that are physiologically tolerable and do not typically produce an allergic 
or similar untoward reaction, such as gastric upset, dizziness and the like, when 
administered to a human. 

« 

The phrase "therapeutically effective amount" is used herein to mean an amount 
30 sufficient to prevent, and preferably reduce by at least about 30 percent, more 

preferably by at least 50 percent, most preferably by at least 90 percent, a clinically 
significant infection by streptococcal bacterium. Alternatively, in the case* of a vaccine 
or immunogenic composition, a therapeutically effective amount is used herein to 
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mean an amount sufficient and suitable to elicit an immune response and antibody 
response in an individual, and particularly to provide a response sufficient to prevent, 
and preferably reduce by at least about 30 percent, more preferably by at least 50 
percent, most preferably by at least 90 percent, a clinically significant infection by 
5 streptococcal bacterium. 

Controlled or sustained release compositions include formulation in lipophilic depots 
(e.g. fatty acids, waxes, oils). Also comprehended by the invention are particulate 
compositions coated with polymers (e.g. poloxamers or poloxamines) and the 
10 compound coupled to antibodies directed against tissue-specific receptors, ligands or 
antigens or coupled to ligands of tissue-specific receptors. Other embodiments of the 
compositions of the invention incorporate particulate forms protective coatings, 
protease inhibitors or permeation enhancers for various routes of administration, 
including parenteral, pulmonary, nasal and oral. 

15 

When administered, compounds are often cleared rapidly from mucosal surfaces or 
the circulation and may therefore elicit relatively short-lived pharmacological activity. 
Consequently, frequent administrations of relatively large doses of bioactive 
compounds may by required to sustain therapeutic efficacy. Compounds modified by 

20 the covalent attachment of water-soluble polymers such as polyethylene glycol, 

copolymers of polyethylene glycol and polypropylene glycol, carboxymethyl cellulose, 
dextran, polyvinyl alcohol, polyvinylpyrrolidone or polyproline are known to exhibit 
substantially longer half-lives in blood following intravenous injection than do the 
corresponding unmodified compounds (Abuchowski et al., 1981; Newmark et al., 

25 1982; and Katre et al., 1987). Such modifications may also increase the compound's 
solubility in aqueous solution, eliminate aggregation, enhance the physical and 
chemical stability of the compound, and greatly reduce the immunogenicity and 
reactivity of the compound. As a result, the desired in vivo biological activity may be 
achieved by the administration of such polymer-compound abducts less frequently or 

30 in lower doses than with the unmodified compound. 
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Dosages. The sufficient amount may include but is not limited to from about 1 ug/kg 
to about 1000 mg/kg. The amount may be 10 mg/kg. The pharmaceutically 
acceptable form of the composition includes a pharmaceutically acceptable carrier. 

5 As noted above, the present invention provides therapeutic compositions comprising 
pharmaceutical compositions comprising vectors, vaccines, polypeptides, nucleic acids 
and antibodies, anti-antibodies, and agents, to compete with the Group B 
streptococcus bacterium for pathogenic activities, such as adherence to host cells. 

10 The preparation of therapeutic compositions which contain an active component is 
well understood in the art. Typically, such compositions are prepared as an aerosol of 
the polypeptide delivered to the nasopharynx or as injectables, either as liquid 
solutions or suspensions, however, solid forms suitable for solution in, or suspension 
in, liquid prior to injection can also be prepared. The preparation can also be 

15 emulsified. The active therapeutic ingredient is often mixed with excipients which are 
pharmaceutically acceptable and compatible with the active ingredient. Suitable 
excipients are, for example, water, saline, dextrose, glycerol, ethanol, or the like and 
combinations thereof. In addition, if desired, the composition can contain minor 
amounts of auxiliary substances such as wetting or emulsifying agents, pH buffering 

20 agents which enhance the effectiveness of the active ingredient. 

An active component can be formulated into the therapeutic composition as 
neutralized pharmaceutically acceptable salt forms. Pharmaceutically acceptable salts 
include the acid addition salts (formed with the free amino groups of the polypeptide 

25 or antibody molecule) and which are formed with inorganic acids such as, for example, 
hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, 
mandelic, and the like. Salts formed from the free carboxyl groups can also be derived 
from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or 
ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 2- 

30 ethylamino ethanol, histidine, procaine, and the like. 

A composition comprising "A" (where "A" is a single protein, DNA molecule, vector, 
etc.) is substantially free of "B" (where "B" comprises one or more contaminating 
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proteins, DNA molecules, vectors, etc.) when at least about 75% by weight of the 
proteins, DNA, vectors (depending on the category of species to which A and B 
belong) in the composition is "A". Preferably, "A" comprises at least about 90% by 
weight of the A-HB species in the composition, most preferably at least about 99% by 
5 weight. 

The phrase "therapeutically effective amount" is used herein to mean an amount 
sufficient to reduce by at least about 15 percent, preferably by at least 50 percent, 
more preferably by at least 90 percent, and most preferably prevent, a clinically 

10 significant deficit in the activity, function and response of the host. Alternatively, a 
therapeutically effective amount is sufficient to cause an improvement in a clinically 
significant condition in the host. In the context of the present invention, a deficit in the 
response of the host is evidenced by continuing or spreading bacterial infection. An 
improvement in a clinically significant condition in the host includes a decrease in 

15 bacterial load, clearance of bacteria from colonized host cells, reduction in fever or 
inflammation associated with infection, or a reduction in any symptom associated with 
the bacterial infection. 

According to the invention, the component or components of a therapeutic 
20 composition of the invention may be introduced parenterally, transmucosally, e.g., 

orally, nasally, pulmonarailly, or rectally, or transdermally. Preferably, administration 
is parenteral, e.g., via intravenous injection, and also including, but is not limited to, 
intra-arteriole, intramuscular, intradermal, subcutaneous, intraperitoneal, 
intraventricular, and intracranial administration. Oral or pulmonary delivery may be 
25 preferred to activate mucosal immunity; since Group B streptococci generally colonize 
the nasopharyngeal and pulmonary mucosa, particularly that of neonates, mucosal 
immunity may be a particularly effective preventive treatment. The term "unit dose" 
when used in reference to a therapeutic composition of the present invention refers to 
physically discrete units suitable as unitary dosage for humans, each unit containing a 
30 predetermined quantity of active material calculated to produce the desired therapeutic 
effect in association with the required diluent; i.e., carrier, or vehicle. 
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In another embodiment, the active compound can be delivered in a vesicle, in 
particular a liposome (see Langer, Science 249:1527-1533 (1990); Treat et aL, in 
Liposomes in the Therapy of Infectious Disease and Cancer, Lopez-Berestein and 
Fidler (eds.), Liss, New York, pp. 353-365 (1989); Lopez-Berestein, ibid., pp. 317- 
5 327; see generally ibid). 

In yet another embodiment, the therapeutic compound can be delivered in a controlled 
release system. For example, the polypeptide may be administered using intravenous 
infusion, an implantable osmotic pump, a transdermal patch, liposomes, or other 

10 modes of administration. In one embodiment, a pump may be used (see Langer, supra; 
Sefton, CRC Crit Ref Biomed Eng. 14:201 (1987); Buchwald et aL, Surgery 88:507 
(1980); Saudek et aL, N. Engl. J. Med. 321:574 (1989)). In another embodiment, 
polymeric materials can be used (see Medical Applications of Controlled Release, 
Langer and Wise (eds.), CRC Pres., Boca Raton, Florida (1974); Controlled Drug 

15 Bioavailability, Drug Product Design and Performance, Smolen and Ball (eds.), 
Wiley, New York (1984); Ranger and Peppas, J. Macromol. Set Rev. Macromol. 
Chern. 23:61 (1983); see also Levy et aL, Science 228:190 (1985); During et aL, Ann. 
Neurol. 25:351 (1989); Howard et aL, J. Neurosurg. 71:105 (1989)). In yet another 
embodiment, a controlled release system can be placed in proximity of the therapeutic 

20 target, i.e., the brain, thus requiring only a fraction of the systemic dose (see, e.g., 
Goodson, in Medical Applications of Controlled Release, supra, vol. 2, pp. 115-138 
(1984)). Preferably, a controlled release device is introduced into a subject in 
proximity of the site of inappropriate immune activation or a tumor. Other controlled 
release systems are discussed in the review by Langer (Science 249:1527-1533 

25 (1990)). 

A subject in whom administration of an active component as set forth above is an 
effective therapeutic regimen for a bacterial infection is preferably a human, but can be 
any animal. Thus, as can be readily appreciated by one of ordinary skill in the art, the 
30 methods and pharmaceutical compositions of the present invention are particularly 

suited to administration to any animal, particularly a mammal, and including, but by no 
means limited to, domestic animals, such as feline or canine subjects, farm animals, 
such as but not limited to bovine, equine, caprine, ovine, and porcine subjects, wild 
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animals (whether in the wild or in a zoological garden), research animals, such as mice, 
rats, rabbits, goats, sheep, pigs, dogs, cats, etc., i.e., for veterinary medical use. 

In the therapeutic methods and compositions of the invention, a therapeutically 
5 effective dosage of the active component is provided. A therapeutically effective 
dosage can be determined by the ordinary skilled medical worker based on patient 
characteristics (age, weight, sex, condition, complications, other diseases, etc.), as is 
well known in the art. Furthermore, as further routine studies are conducted, more 
specific information will emerge regarding appropriate dosage levels for treatment of 

10 various conditions in various patients, and the ordinary skilled worker, considering the 
therapeutic context, age and general health of the recipient, is able to ascertain proper 
dosing. Generally, for intravenous injection or infusion, dosage may be lower than for 
intraperitoneal, intramuscular, or other route of administration. The dosing schedule 
may vary, depending on the circulation half-life, and the formulation used. The 

15 compositions are administered in a manner compatible with the dosage formulation in 
the therapeutically effective amount. Precise amounts of active ingredient required to 
be administered depend on the judgment of the practitioner and are peculiar to each 
individual. However, suitable dosages may range from about 0. 1 to 20, preferably 
about 0.5 to about 10, and more preferably one to several, milligrams of active 

20 ingredient per kilogram body weight of individual per day and depend on the route of 
administration. Suitable regimes for initial administration and booster shots are also 
variable, but are typified by an initial administration followed by repeated doses at one 
or more hour intervals by a subsequent injection or other administration. 
Alternatively, continuous intravenous infusion sufficient to maintain concentrations of 

25 ten nanomolar to ten micromolar in the blood are contemplated. 

Administration with other compounds. For treatment of a bacterial infection, one may 
administer the present active component in conjunction with one or more 
pharmaceutical compositions used for treating bacterial infection, including but not 
30 limited to (1) antibiotics; (2) soluble carbohydrate inhibitors of bacterial adhesin; (3) 
other small molecule inhibitors of bacterial adhesin; (4) inhibitors of bacterial 
metabolism, transport, or transformation; (5) stimulators of bacterial lysis,«or (6) anti- 
bacterial antibodies or vaccines directed at other bacterial antigens. Other potential 
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active components include anti-inflammatory agents, such as steroids and non- 
steroidal anti-inflammatory drugs. Administration may be simultaneous (for example, 
administration of a mixture of the present active component and an antibiotic), or may 
be in seriatim. 

5 

Accordingly, in specific embodiment, the therapeutic compositions may further include 
an effective amount of the active component, and one or more of the following active 
ingredients: an antibiotic, a steroid, etc. 

* 

10 Thus, in a specific instance where it is desired to reduce or inhibit the infection 

resulting from a bacterium mediated binding of bacteria to a host cell, or an antibody 
thereto, or a ligand thereof or an antibody to that ligand, the polypeptide is introduced 
to block the interaction of the bacteria with the host cell. 

15 Also contemplated herein is pulmonary delivery of an inhibitor of the polypeptide of 
the present invention having which acts as adhesin inhibitory agent (or derivatives 
thereof). The adhesin inhibitory agent (or derivative) is delivered to the lungs of a 
mammal, where it can interfere with bacterial, i.e., streptococcal, and preferably Group 
B streptococcal binding to host cells. Other reports of preparation of proteins for 

20 pulmonary delivery are found in the art [Adjei et al. (1 990) Pharmaceutical Research, 
7:565-569; Adjei et a/.(1990) International Journal of Pharmaceutics, 63:135-144 
(leuprolide acetate); Braquet et al (1989), Journal of Cardiovascular Pharmacology, 
13(suppl. 5): 143-146 (endothelin-1); Hubbard et a/.(1989) Annals of Internal 
Medicine, Vol. HI, pp. 206-212 (a 1 -antitrypsin); Smiths al (1989) J. Clin. Invest. 

25 84: 1 145-1 146 (a- 1 -proteinase); Oswein et al., "Aerosolization of Proteins", 

Proceedings of Symposium on Respiratory Drug Delivery II, Keystone, Colorado, 
March, (1990) (recombinant human growth hormone); Debs et al. (1988) J. Immunol. 
140:3482-3488 (interferon-y and tumor necrosis factor alpha); Platz et al, U.S. Patent 
No. 5,284,656 (granulocyte colony stimulating factor)]. A method and composition 

30 for pulmonary delivery of drugs is described in U.S. Patent No. 5,45 1,569, issued 
September 19, 1995 to Wong et al. 
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All such devices require the use of formulations suitable for the dispensing of adhesin 
inhibitory agent (or derivative). Typically, each formulation is specific to the type of 
device employed and may involve the use of an appropriate propellant material, in 
addition to the usual diluents, adjuvant and/or carriers useful in therapy. Also, the use 
5 of liposomes, microcapsules or microspheres, inclusion complexes, or other types of 
carriers is contemplated. Chemically modified adhesin inhibitory agent may also be 
prepared in different formulations depending on the type of chemical modification or 
the type of device employed. 

10 Formulations suitable for use with a.nebulizer, either jet or ultrasonic, will typically 
comprise adhesin inhibitory agent (or derivative) dissolved in water at a concentration 
of about 0. 1 to 25 mg of biologically active adhesin inhibitory agent per ml of solution. 
The formulation may also include a buffer and a simple sugar (e.g., for adhesin 
inhibitory agent stabilization and regulation of osmotic pressure). The nebulizer 

15 formulation may also contain a surfactant, to reduce or prevent surface induced 

aggregation of the adhesin inhibitory agent caused by atomization of the solution in 
forming the aerosol. 

Formulations for use with a metered-dose inhaler device will generally comprise a 
20 finely divided powder containing the adhesin inhibitory agent (or derivative) suspended 
in a propellant with the aid of a surfactant. The propellant may be any conventional 
material employed for this purpose, such as a chlorofluoro carbon, a 
hydrochlorofluorocarbon, a hydrofluorocarbon, or a hydrocarbon, including 
trichlorofluoromethane, dichlorodifluoromethane, dichlorotetrafluoroethanol, and 
25 1,1,1 ,2-tetrafluoroethane, or combinations thereof. Suitable surfactants include 
sorbitan trioleate and soya lecithin. Oleic acid may also be useful as a surfactant. 

The liquid aerosol formulations contain adhesin inhibitory agent and a dispersing agent 
in a physiologically acceptable diluent. The dry powder aerosol formulations of the 
30 present invention consist of a finely divided solid form of adhesin inhibitory agent and 
a dispersing agent. With either the liquid or dry powder aerosol formulation, the 
formulation must be aerosolized. That is, it must be broken down into liquid or solid 
particles in order to ensure that the aerosolized dose actually reaches the mucous 
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membranes of the nasal passages or the lung. The term "aerosol particle" is used 
herein to describe the liquid or solid particle suitable for nasal or pulmonary 
administration, Le. 9 that will reach the mucous membranes. Other considerations, such 
as construction of the delivery device, additional components in the formulation, and 
5 particle characteristics are important. These aspects of pulmonary administration of a 
drug are well known in the art, and manipulation of formulations, aerosolization means 
and construction of a delivery device require at most routine experimentation by one 
of ordinary skill in the art. In a particular embodiment, the mass median dynamic 
diameter will be 5 micrometers or less in order to ensure that the drug particles reach 
10 the lung alveoli [Wearley, L.L. (1991) Crit Rev. in Ther. Drug Carrier Systems 
8:333]. 

Systems of aerosol delivery, such as the pressurized metered dose inhaler and the dry 
powder inhaler are disclosed in Newman, S.P., Aerosols and the Lung, Clarke, S.W. 
15 and Davia, D. editors, pp. 197-22 and can be used in connection with the present 
invention. 

■ 

In a further embodiment, as discussed in detail infra, an aerosol formulation of the 
present invention can include other therapeutically or pharmacologically active 
20 ingredients in addition to adhesin inhibitory agent, such as but not limited to an 
antibiotic, a steroid, a non-steroidal anti-inflammatory drug, etc. 

Liquid Aerosol Formulations. The present invention provides aerosol formulations 
and dosage forms for use in treating subjects suffering from bacterial, e.g., 

25 streptococcal, in particularly streptococcal, infection. In general such dosage forms 
contain adhesin inhibitory agent in a pharmaceutically acceptable diluent. 
Pharmaceutically acceptable diluents include but are not limited to sterile water, saline, 
buffered saline, dextrose solution, and the like. In a specific embodiment, a diluent 
that may be used in the present invention or the pharmaceutical formulation of the 

30 present invention is phosphate buffered saline, or a buffered saline solution generally 
between the pH 7.0-8.0 range, or water. 



« 
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The liquid aerosol formulation of the present invention may include, as optional 
ingredients, pharmaceutically acceptable carriers, diluents, solubilizing or emulsifying 
agents, surfactants and excipients. The formulation may include a carrier. The carrier 
is a macromolecule which is soluble in the circulatory system and which is 
5 physiologically acceptable where physiological acceptance means that those of skill in 
the art would accept injection of said carrier into a patient as part of a therapeutic 
regime. The carrier preferably is relatively stable in the circulatory system with an 
acceptable plasma half life for clearance. Such macromolecules include but are not 
limited to Soya lecithin, oleic acid and sorbitan trioleate, with sorbitan trioleate 
10 preferred. 

The formulations of the present embodiment may also include other agents useful for 
pH maintenance, solution stabilization, or for the regulation of osmotic pressure. 
Examples of the agents include but are not limited to salts, such as sodium chloride, or 
15 potassium chloride, and carbohydrates, such as glucose, galactose or mannose, and the 
like. 

The present invention further contemplates liquid aerosol formulations comprising 
adhesin inhibitory agent and another therapeutically effective drug, such as an 
20 antibiotic, a steroid, a non-steroidal anti-inflammatory drug, etc. 

Aerosol Dry Powder Formulations. It is also contemplated that the present aerosol 
formulation can be prepared as a dry powder formulation comprising a finely divided 
powder form of adhesin inhibitory agent and a dispersant 

25 Formulations for dispensing from a powder inhaler device will comprise a finely 

divided dry powder containing adhesin inhibitory agent (or derivative) and may also 
include a bulking agent, such as lactose, sorbitol, sucrose, or mannitol in amounts 
which facilitate dispersal of the powder from the device, e.g., 50 to 90% by weight of 
the formulation. The adhesin inhibitory agent (or derivative) should most 

30 advantageously be prepared in particulate form with an average particle size of less 
than 10 mm (or microns), most preferably 0.5 to 5 mm, for most effective delivery to 
the distal lung. In another embodiment, the dry powder formulation can comprise a 
finely divided dry powder containing adhesin inhibitory agent, a dispersing agent and 



WO 02/12294 PCT/US01/24795 

86 

also a bulking agent. Bulking agents useful in conjunction with the present 
formulation include such agents as lactose, sorbitol, sucrose, or mannitol, in amounts 
that facilitate the dispersal of the powder from the device. 

5 The present invention further contemplates dry powder formulations comprising 
adhesin inhibitory agent and another therapeutically effective drug, such as an 
antibiotic, a steroid, a non-steroidal anti-inflammatory drug, etc. 

Contemplated for use herein are oral solid dosage forms, which are described generally 
10 in Remington's Pharmaceutical Sciences, 1 8th Ed. 1990 (Mack Publishing Co. Easton 
PA 1 8042) at Chapter 89, which is herein incorporated by reference. Solid dosage 
forms include tablets, capsules, pills, troches or lozenges, cachets or pellets. Also, 
liposomal or proteinoid encapsulation may be used to formulate the present 
compositions (as, for example, proteinoid microspheres reported in U.S. Patent 
15 No. 4,925,673). Liposomal encapsulation may be used and the liposomes may be 

derivatized with various polymers (e.g., U.S. Patent No. 5,013,556). A description of 
possible solid dosage forms for the therapeutic is given by Marshall, K. In: Modern 
Pharmaceutics Edited by G.S. Banker and C.T. Rhodes Chapter 10, 1979, herein 
incorporated by reference. In general, the formulation will include the component or 
20 components (or chemically modified forms thereof) and inert ingredients which allow 
for protection against the stomach environment, and release of the biologically active 
material in the intestine. 

Also specifically contemplated are oral dosage forms of the above derivatized 
25 component or components. The component or components may be chemically 

modified so that oral delivery of the derivative is efficacious. Generally, the chemical 
modification contemplated is the attachment of at least one moiety to the component 
molecule itself, where said moiety permits (a) inhibition of proteolysis; and (b) uptake 
into the blood stream from the stomach or intestine. Also desired is the increase in 
30 overall stability of the component or components and increase in circulation time in the 
body. Examples of such moieties include: polyethylene glycol, copolymers of 
ethylene glycol and propylene glycol, carboxymethyl cellulose, dextran, pQlyvinyl 
alcohol, polyvinyl pyrrolidone and polyproline. Abuchowski and Davis, 1981, 
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"Soluble Polymer-Enzyme Abducts" In: Enzymes as Drugs, Hocenberg and Roberts, 
eds., Wiley-Interscience, New York, NY, pp. 367-383; Newmark, et al (1982) J. 
AppL Biochem. 4:185-189. Other polymers that could be used are poly- 1 ,3 -dioxolane 
and poly-l,3,6-tioxocane. Preferred for pharmaceutical usage, as indicated above, are 
5 polyethylene glycol moieties. 

For the component (or derivative) the location of release may be the stomach, the 
small intestine (the duodenum, the jejunem, or the ileum), or the large intestine. One 
skilled in the art has available formulations which will not dissolve in the stomach, yet 
10 will release the material in the duodenum or elsewhere in the intestine. Preferably, the 
release will avoid the deleterious effects of the stomach environment, either by 
protection of the protein (or derivative) or by release of the biologically active material 
beyond the stomach environment, such as in the intestine. 

15 To ensure full gastric resistance a coating impermeable to at least pH 5.0 is essential. 
Examples of the more common inert ingredients that are used as enteric coatings are 
cellulose acetate trimellitate (CAT), hydroxypropylmethylcellulose phthalate 
(HPMCP), HPMCP 50, HPMCP 55, polyvinyl acetate phthalate (PVAP), Eudragit 
L30D, Aquateric, cellulose acetate phthalate (CAP), Eudragit L, Eudragit S, and 

20 Shellac. These coatings may be used as mixed films. 

A coating or mixture of coatings can also be used on tablets, which are not intended 
for protection against the stomach. This can include sugar coatings, or coatings which 
make the tablet easier to swallow. Capsules may consist of a hard shell (such as 
25 gelatin) for delivery of dry therapeutic i.e. powder; for liquid forms, a soft gelatin shell 
may be used. The shell material of cachets could be thick starch or other edible paper. 
For pills, lozenges, molded tablets or tablet triturates, moist massing techniques can be 
used. 

30 The peptide therapeutic can be included in the formulation as fine multiparticulates in 
the form of granules or pellets of particle size about 1mm. The formulation of the 
material for capsule administration could also be as a powder, lightly compressed 
plugs or even as tablets. The therapeutic could be prepared by compression. 
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Colorants and flavoring agents may all be included. For example, the protein (or 
derivative) may be formulated (such as by liposome or microsphere encapsulation) and 
then further contained within an edible product, such as a refrigerated beverage 
containing colorants and flavoring agents. 

5 

One may dilute or increase the volume of the therapeutic with an inert material. These 
diluents could include carbohydrates, especially mannitol, a-lactose, anhydrous lactose, 
cellulose, sucrose, modified dextran and starch. Certain inorganic salts may be also be 
used as fillers including calcium triphosphate, magnesium carbonate and sodium 
10 chloride. Some commercially available diluents are Fast-Flo, Emdex, STA-Rx 1500, 
Emcompress and Avicell. 

Disintegrants may be included in the formulation of the therapeutic into a solid dosage 
form. Materials used as disintegrates include but are not limited to starch, including 

15 the commercial disintegrant based on starch, Explotab. Sodium starch glycolate, 

Amberiite, sodium carboxymethylcellulose, ultramyiopectin, sodium alginate, gelatin, 
orange peel, acid carboxymethyl cellulose, natural sponge and bentonite may all be 
used. Another form of the disintegrants are the insoluble cationic exchange resins. 
Powdered gums may be used as disintegrants and as binders and these can include 

20 powdered gums such as agar, Karaya or tragacanth. Alginic acid and its sodium salt 
are also useful as disintegrants. Binders may be used to hold the therapeutic agent 
together to form a hard tablet and include materials from natural products such as 
acacia, tragacanth, starch and gelatin. Others include methyl cellulose (MC), ethyl 
cellulose (EC) and carboxymethyl cellulose (CMC). Polyvinyl pyrrolidone (PVP) and 

25 hydroxypropylmethyl cellulose (HPMC) could both be used in alcoholic solutions to 
granulate the therapeutic. 

An antifrictional agent may be included in the formulation of the therapeutic to prevent 
sticking during the formulation process. Lubricants may be used as a layer between 
30 the therapeutic and the die wall, and these can include but are not limited to; stearic 
acid including its magnesium and calcium salts, polytetrafluoroethylene (PTFE), liquid 
paraffin, vegetable oils and waxes. Soluble lubricants may also be used such as sodium 
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lauryl sulfate, magnesium lauryl sulfate, polyethylene glycol of various molecular 
weights, Carbowax 4000 and 6000. 

Glidants that might improve the flow properties of the drug during formulation and to 
5 aid rearrangement during compression might be added. The glidants may include 
starch, talc, pyrogenic silica and hydrated silicoaluminate. 

To aid dissolution of the therapeutic into the aqueous environment a surfactant might 
be added as a wetting agent. Surfactants may include anionic detergents such as 

10 sodium lauryl sulfate, dioctyl sodiunj sulfosuccinate and dioctyl sodium sulfonate. 
Cationic detergents might be used and could include benzalkonium chloride or 
benzethomium chloride. The list of potential nonionic detergents that could be 
included in the formulation as surfactants are lauromacrogol 400, polyoxyl 40 stearate, 
poly oxy ethylene hydrogenated castor oil 10, 50 and 60, glycerol monostearate, 

15 polysorbate 40, 60, 65 and 80, sucrose fatty acid ester, methyl cellulose and 

carboxymethyl cellulose. These surfactants could be present in the formulation of the 
protein or derivative either alone or as a mixture in different ratios. 

Additives which potentially enhance uptake of the polypeptide (or derivative) are for 
20 instance the fatty acids oleic acid, linoleic acid and linolenic acid. 

Pulmonary Delivery, Also contemplated herein is pulmonary delivery of the present 
polypeptide (or derivatives thereof). The polypeptide (or derivative) is delivered to 
the lungs of a mammal while inhaling and coats the mucosal surface of the alveoli. 

25 Other reports of this include Adjei et al (1990) Pharmaceutical Research 7:565-569; 
Adjei et al (1990) International Journal of Pharmaceutics 63 : 135-144 (leuprolide 
acetate); Braquet et a/.(1989) Journal of Cardiovascular Pharmacology, 
13(suppl. 5): 143-146 (endothelin-1); Hubbard et al. (1989) Annals of Internal 
Medicine,Vol III, pp. 206-212 (al- antitrypsin); Smith et al (1989) J. Clin. Invest. 

30 84: 1 145-1 146 (a- 1 -proteinase); Oswein et al (1990) "Aerosolization of Proteins", 
Proceedings of Symposium on Respiratory Drug Delivery II 9 Keystone, Colorado, 
March, (recombinant human growth hormone); Debs et al (1988) J. Immunol 
140:3482-3488 (interferon-g and tumor necrosis factor alpha) and Platz et al, U.S. 
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Patent No. 5,284,656 (granulocyte colony stimulating factor). A method and 
composition for pulmonary delivery of drugs for systemic effect is described in U.S. 
Patent No. 5,451,569, issued September 19, 1995 to Wong et al. 

5 Contemplated for use in the practice of this invention are a wide range of mechanical 
devices designed for pulmonary delivery of therapeutic products, including but not 
limited to nebulizers, metered dose inhalers, and powder inhalers, all of which are 

familiar to those skilled in the art. 

■« 

10 Formulations suitable for use with a nebulizer, either jet or ultrasonic, will typically 
comprise polypeptide (or derivative) dissolved in water at a concentration of about 
0. 1 to 25 mg of biologically active protein per mL of solution. The formulation may 
also include a buffer and a simple sugar (e.g., for protein stabilization and regulation of 
osmotic pressure). The nebulizer formulation may also contain a surfactant, to reduce 

15 or prevent surface induced aggregation of the protein caused by atomization of the 
solution in forming the aerosol. 

Formulations for use with a metered-dose inhaler device will generally comprise a 
finely divided powder containing the polypeptide (or derivative) suspended in a 

20 propellant with the aid of a surfactant. The propellant may be any conventional 
material employed for this purpose, such as a chlorofluorocarbon, a 
hydrochlorofluorocarbon, a hydrofluorocarbon, or a hydrocarbon, including 
trichlorofluoromethane, dichlorodifluoromethane, dichlorotetrafluoroethanol, and 
1, 1, 1,2-tetrafluoroethane, or combinations thereof. Suitable surfactants include 

25 sorbitan trioleate and soya lecithin. Oleic acid may also be useful as a surfactant. 

Formulations for dispensing from a powder inhaler device will comprise a finely 
divided dry powder containing polypeptide (or derivative) and may also include a 
bulking agent, such as lactose, sorbitol, sucrose, or mannitol in amounts which 
30 facilitate dispersal of the powder from the device, e.g., 50 to 90% by weight of the 
formulation. The protein (or derivative) should most advantageously be prepared in 
particulate form with an average particle size of less than 10 mm (or microns), most 
preferably 0.5 to 5 mm, for most effective delivery to the distal lung. 
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Nasal Delivery. Nasal or nasopharyngeal delivery of the polypeptide (or derivative) is 
also contemplated. Nasal delivery allows the passage of the polypeptide directly over 
the upper respiratory tract mucosal after administering the therapeutic product to the 
nose, without the necessity for deposition of the product in the lung. Formulations for 
5 nasal delivery include those with dextran or cyclodextran. 

tide nomenclature, J. Biol Chem. , 243:3552-59 (1969), abbreviations for amino acid 

The therapeutic polypeptide-, analog- or active fragment-containing compositions are 
conventionally administered intravenously, as by injection of a unit dose, for example. 
10 The term "unit dose" when used in reference to a therapeutic composition of the 
present invention refers to physically discrete units suitable as unitary dosage for 
humans, each unit containing a predetermined quantity of active material calculated to 
produce the desired therapeutic effect in association with the required diluent; i.e., 
carrier, or vehicle. 

15 

The compositions are administered in a manner compatible with the dosage 
formulation, and in a therapeutically effective amount. The quantity to be 
administered depends on the subject to be treated, capacity of the subject's immune 
system to utilize the active ingredient, and degree of inhibition or neutralization of ~ 

20 binding capacity desired. Precise amounts of active ingredient required to be 

administered depend on the judgment of the practitioner and are peculiar to each 
individual. However, suitable dosages may range from about 0.1 to 20, preferably 
about 0.5 to about 10, and more preferably one to several, milligrams of active 
ingredient per kilogram body weight of individual per day and depend on the route of 

25 administration. Suitable regimes for initial administration and booster shots are also 
variable, but are typified by an initial administration followed by repeated doses at one 
or more hour intervals by a subsequent injection or other administration. 
Alternatively, continuous intravenous infusion sufficient to maintain concentrations of 
ten nanomolar to ten micromolar in the blood are contemplated. 

30 

The invention may be better understood by reference to the following non-limiting 
Examples, which are provided as exemplary of the invention. The following examples 
are presented in order to more fully illustrate the preferred embodiments of the 
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invention and should in no way be construed, however, as limiting the broad scope of 
the invention. 

EXAMPLE 1 

5 

IDENTIFICATION OF GROUP B STREPTOCOCCUS GENES 

Comparing the genetic and phenotypic composition of genetically-related groups of a 
bacterial species facilitates identifying virulence factors present in the most pathogenic 

10 groups. Type III GBS can be subdivided into three groups of related strains based on 
the analysis of restriction digest patterns (RDPs) produced by digestion of 
chromosomal DNA with Hind III and Sse 8387 (5, 6). Over 90% of invasive type III 
GBS disease in neonates in Japan and in Salt Lake City is caused by bacteria from one 
of three RDP types, termed RDP type III-3, while RDP type III-2 are significantly 

15 more likely to be isolated from vagina than from blood or CSF (6). These results 
suggest that this genetically-related cluster of type III-3 GBS are more virulent than 
III-2 strains and could be responsible for the majority of invasive type III disease 
globally. We proposed that bacterial factors that contribute to the increased virulence 
of III-3 strains can be identified by characterizing the differences between the genetic 

20 composition of III-3 and III-2 strains. Such genetic differences will be found in the 
bacterial chromosomes since these strains do not contain plasmids (6). 

To identify genes present in virulent type III-3 GBS strains and not in the avirulent 
type III-2 strains we used a modification of the technique described by Lisitsyn et al 

25 (7). High molecular weight genomic DNA from an invasive RDP type III-3 GBS strain 
(strain 874391) and a colonizing ("avirulent") RDP type III-2 strain (strain 865043) 
was prepared by cell lysis with mutanolysin and Proteinase K digestion (5). For 
genetic subtraction, genomic DNA from both strains was digested withTaq I. Taq 
I-digested DNA from the virulent strain was mixed with two complementary 

30 oligonucleotides (TaqA (5'-CTAGGTGGATCCTTCGGCAAT-3 1 (SEQ ID NO: 1 1)) 
and TaqB (5-CGATTGCCGA-3 ' (SEQ ID NO: 12)), heated to 50°C for 5 minutes, 
then allowed to cool slowly to 16°C in T4 ligase buffer. Oligonucleotides were ligated 
to the virulent strain DNA by incubation with 20 units of T4 ligase at 16°C for 12 
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hours. After ligation, 500 ng of DNA from the virulent strain, with ligated linkers, and 
40 ug of DNA from the avirulent strain, without linkers, was mixed together, 
denatured by heating, and hybridized at 68°C for 20 hours. 

5 Ten percent of the resulting hybridization mixture was incubated with Taq DNA 
polymerase and dNTPs to fill in the ends of annealed virulent strain DNA. The 
hybridized DNA was amplified by Taq DNA polymerase for 1 0 cycles using the TaqA 
oligonucleotide as the forward and reverse amplification primer. After amplification, 
single stranded products remaining after amplification were digested with mung bean 
10 nuclease. Twenty percent of the resulting product was then reamplified for 20 cycles. 
This process of subtraction followed by PGR amplification results in enhanced 
amplification of DNA segments from the III-3 strains that do not hybridize with DNA 
segments from the III-2 strains. 

15 A total of four cycles of subtraction and amplification were carried out, using 

successively smaller quantities of III-3 specific PGR products and alternating two sets 
of adaptors (TaqA/B (SEQ IDNOS: 11 and 12, respectively) and TaqE/F (TaqE (5'- 
AGGC AACTGTGCT AACCGAGGGAAT-3 1 (SEQ ID NO: 13)); and TaqF (5'- 
CGATTCCCTCG-3' (SEQ ID NO: 14)). The final amplification products were 

20 ligated into pBS KS+ vectors. Thirteen clones were randomly selected for analysis. 
These probes were used in slot and dot blot experiments to determine whether 
subtraction was successful and to identify probes hybridizing with all III-3 strains. 
Each of the 6 unique probes hybridized with the parental III-3 virulent strain, while 
none of the probes hybridized with the avirulent III-2 strains. Two of the amplified 

25 sequence tags (clones DY1-1 and DY1-1 1) hybridized with genomic DNA from all 62 
type III isolates, but did not hybridize with DNA prepared from the III-2 and III-l 
isolates (FIGURE 1). To obtain additional sequence information, we constructed a 
genomic GBS III-3 library. Multiple plaques hybridizing with each of the III-3 
GBS-specific probes have been purified for further characterization. 

30 

RESULTS 



THE spb LOCUS 



wo 
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Three overlapping genomic clones hybridizing with probe DY1-1 were identified. A 
6.4 kb Sal I-Bgl II fragment present in each clone was subcloned and sequenced. This 
genomic DNA is present in all RDP type III-3 strains but not in serotype III-2, III-l or 
other GBS serotype strains. 

5 

Over 90% of this genomic DNA fragment has been sequenced and found to contain 5 
open reading frames (ORFs). Two ORFs appear to be candidates for virulence genes, 
spbl is a 1509 bp ORF. The predicted protein (502 amino acids and Mr 53,446) has 
the characteristics of a cell-wall bound protein. The nucleic acid and predicted amino 

10 acid sequences of sbpl are provided, in SEQ ID NOS: 15 and 16, respectively. The 
N-terminus of the predicted protein is a hydrophilic, basic stretch of 6 amino acids 
followed by a 23 amino acid hydrophobic, proline-rich core, consistent with a signal 
peptide. The hydrophilic mature protein terminates in a typical LPXTG (SEQ ID NO: 
17) domain that immediately precedes a hydrophobic 20 amino acid core and a short, 

15 basic hydrophilic terminus. The nucleotide sequence is not homologous to sequences 
of other known bacterial genes. The translated amino acid sequence, however, shares 
segmental homology with a number of characterized proteins, including the fimbrial 
type 2 protein of Actinomyces naeshindii (27% identity over 350 amino acids) and the 
fimbrial type 1 protein of Actinomyces viscosus (25% homology over 420 amino 

20 acids) (16), the T6 surface protein of S. pyogenes (23% identity over 359 amino 
acids) (20), and the hsf (27% identity over 260 amino acids) and HMW1 adhesins 
(25% identity over 285 amino acids) of Haemophilus influenzae (21, 22). The 
function of the S. pyogenes T6 protein is unknown. Each of the other homologs plays 
a role in bacterial adhesion and/or invasion. 

25 

A spbl' isogenic deletion mutant GBS strain was created by homologous 
recombination (using the method as described in Example 2 below) and the ability of 
the spbl" mutant to adhere to and invade A549 respiratory epithelial cells was 
determined. Compared to the wild type strain, the number of spbV bacteria adherent 
30 to A549 monolayers was reduced by 60.0% (p<0.01) and the number of intracellular 
invading bacteria was reduced by 53.6% (p<0.01). This data suggests spbl may 
contribute to the pathogenesis of GBS pneumonia and bacterial entry into the 
bloodstream. 
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The second ORF, spb2, terminates 37 bp upstream from spbl and is in the same 
transcriptional orientation. This 1692 bp ORF has a deduced amino acid sequence of 
579 residues and Mr 64,492. The nucleic acid and predicted amino acid sequences of 
sbp2 are provided in SEQ ID NOS: 18 and 19, respectively. spb2 shares 50.5% 
5 nucleic acid identity and 20.7% amino acid identity with spbl. Conservation is 
highest in the carboxy-terminal regions, including a shared LPSTGG (SEQ ID NO: 
20) motif. In contrast to spbl, spb2 does not have a obvious signal sequence. Its 
secretion may be mediated by carboxy-terminal recognition sequences or by accessory 
peptides (23). The deduced amino acid sequence of Spb2 is also homologous with S. 
10 pyogenes T6 and Actinomyces naeslundii proteins, and to Listeria monocytogenes 
internalin A (22% identity over 308 amino acids); again, proteins important in 
adhesion and invasion (24). 

THE ema LOCUS 

15 

Two genomic clones hybridizing with probe DY1-1 1 were identified. A 7 kb Hind III 
fragment present in each clone was subcloned and sequenced. Unlike the serotype III 
specific spb sequences, this genomic DNA, which is adjacent to a region of serotype 
III-3 specific DNA, was found to be present in all GBS tested to date, including 
20 serotype la, lb, II and V strains. This region of the GBS chromosome, which we have 
designated the extracellular matrix adhesin (ema) locus, contains 5 significant ORFs. 

emaA 

The first ORF, emaA, is 738 bp long, with a predicted protein product of 246 amino 
25 acids and Mr 26.2. The nucleic acid sequence (SEQ ID NO: 1) and predicted amino 
acid sequence (SEQ ID NO: 2) of emaA are shown in FIGURE 2, The EmaA protein 
is a non-repetitive protein. The 27 amino acid N-terminus of the predicted protein is 
consistent with a signal peptide. The mature protein has an imperfect cell wall binding 
domain (XPXTGG (SEQ ID NO:21)) followed by a transmembrane spanning domain 
30 encompassing residues 219-235 and a terminal hydrophilic tail. The emaA nucleotide 
sequence is not homologous to known sequences of bacterial genes. The translated 
amino acid sequence, however, shares segmental homology with a number of 
characterized proteins, including a collagen adhesin, Bbp, of Staphylococcus aureus 
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(37% identity over 103 aa) (15), a type 2 fimbrial structural subunit of Actinomyces 
naeslandii (39% homology over 1 12 aa) (16), and the FimP protein of Actinomyces 
viscosus (28% homology over 228 aa) (17). The function of the S. pyogenes T6 
protein is unknown. The type 1 and type 2 fimbria of Actinomyces mediate bacterial 
5 adhesion to salivary glycoproteins and various host cells, contributing to the 
pathogenesis of dental caries. 

emaB 

The second ORF, emaB, begins 94 bp 3' of emaA and is in the same transcriptional 
10 orientation. The nucleic acid sequence (SEQ ID NO: 3) and predicted amino acid 

sequence (SEQ ID NO: 4) of emaB are shown in FIGURE 3. It is 924 bp long; with a 
predicted protein product of 308 amino acids and Mr 33.9. The predicted EmaB 
protein is a nonrepetitive protein. The 27 amino acids N-terminus of the predicted 
protein is consistent with a signal peptide. The mature protein has an imperfect cell 
15 wall binding domain (XPXTG) followed by a transmembrane spanning domain 

encompassing residues 279-294. The emaB nucleotide sequence is not homologous to 
known sequences of bacterial genes. The translated amino acid sequence, however, 
shares segmental homology with a number of characterized proteins, including a type 2 
fimbrial structural subunit of Actinomyces naeslandii (28% homology over 222 amino 
20 acids), the T6 protein of S. pyogenes (26% homology over 266 amino acids) (20), and 
a S. epidermidis putative cell- surface adhesin (24% identity over 197 amino acids). 
The first of these proteins mediates adhesion of S. aureus to collagen and is postulated 
to contribute to the pathogenesis of osteomyelitis and infectious arthritis. 

25 emaC 

The third ORF, emaC, begins 2 bp 3' of emaB and is the same transcriptional 
orientation. It is 918 bp long, with a predicted protein product of 305 amino acids and 
Mr 34.5. The nucleic acid sequence (SEQ ID NO: 5) and predicted amino acid 
sequence (SEQ ID NO: 6) of emaC are depicted in FIGURE 4. The EmaC protein is 
30 a nonrepetitive protein. The 30 amino acid N-terminus of the predicted protein is 
consistent with a signal peptide. The mature protein has a transmembrane spanning 
domain emcompassing residues 265 - 281. The emaC nucleotide sequence is not 
homologous to known sequences of bacterial genes. The translated amino acid 
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sequence, however, shares segmental homology with a number of characterized 
proteins, including proteins associated with the assembly of type 2 fimbrial structural 
subunit of Actinomyces naeslandii (38% homology over 234 amino acids) (16). 
These proteins are required for the assembly of type 2 fimbria. 

5 

« 

emaD 

The fourth ORF, emaD, is 852 bp long, overlaps emaC by 47 bp, and is in the same 
transcriptional orientation. The predicted protein product is 284 amino acids and Mr 
33.1. The nucleic acid sequence (SEQ ID NO: 7) and predicted amino acid sequence 

10 (SEQ ID NO: 8) of emaD are shown, in FIGURE 5. No indentifiable N-terminal signal 
sequence is present and potential transmembrane segments are present at positions 
19-35 and 252-280. The mature protein is not repetative and lacks a cell wall binding 
domain. The emaD nucleotide sequence is not homologous to known sequences of 
bacterial genes. The translated amino acid sequence, shares segmental homology with 

15 the same fimbria-associated proteins of Actinomyces as does EmaC. 

emaE 

The fifth ORF, emaE, begins 42 bp 3' of emaD and is in the same transcriptional 
orientation. It is 2712 bp long, with a predicted protein product of 904 aa and Mr 

20 100.9. FIGURE 6 depicts the nucleic acid sequence (SEQ ID NO: 9) and predicted 
amino acid sequence (SEQ ID NO: 10) of emaE, The predicted EmaE protein is a 
nonrepetitive protein. An obvious N-terminal signal peptide is not evident but a 
putative transmembrane region is located at residues 24-40. The mature protein has an 
imperfect cell wall binding domain (XPXTGG (SEQ ID NO: 21)) followed by a 

25 transmembrane spanning domain emcompassing residues 880 - 896. The emaE 

nucleotide sequence is not homologous to known sequences of bacterial genes. The 
translated amino acid sequence, however, shares segmental homology with a number 
of characterized proteins, including the Fl and F2 fibronectin binding proteins of S. 
pyogenes (31% homology over 207 amino acids) (18, 19). These proteins mediate 

30 high affinity binding to. fibronectin, and are important in the adhesion of S. pyogenes to 
respiratory cells. 
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The similarity of the protein products of the ema locus to physiologically important 
adhesins and invasins of other bacterial species suggests that the Ema proteins have a 
role in facilitating the adhesion of GBS to extracellular matrix components and to cell 
surfaces and subsequent invasion of epithelial and endothelial cells, the initial steps in 
5 the pathogenesis of infection. 

Several lines of evidence suggest the members of the ema and the spb locus may have 
similar functions, but are likely to represent distinct classes of proteins. The ema and 
spb locus genes are each and all similar to physiologically important adhesions and 

10 invasions of the bacterial species, however, both Spbl and Spb2 have prototypical 
gram positive cell-wall binding domains, whereas the members of the ema locus have 
an unusual motif, suggesting a distinct mechanism of cell surface anchoring. Second, 
the spb locus is restricted to virulent serotype III-3 strains of GBS, whereas the ema 
locus appears to be ubiquitous in all GBS serotypes. Third, spbl and spb2 are more 

15 homologous to one another than to members of the ema locus and ema genes are 
more closely homologous to one another than to spbl and spb2. 

EXAMPLE 2 

20 BIOLOGIC CHARACTERIZATION OF NOVEL GBS GENES 

Isogenic Mutant Bacterial Strains 

To identify biologic activity of these novel GBS genes, isogenic mutant bacterial 
25 strains are created which are identical in all respects except for the presence or absence 
of a particular gene. Deletion mutants are created by allelic replacement. The relevant 
gene, with 100-300 bp of flanking sequences, is subcloned and modified by the 
deletion of an intragenic portion of the coding sequence and, in some cases, the 
insertion of a kanamycin resistance gene. The mutant gene is cloned into the suicide 
30 vector pHY304 (kindly provided by Dr. Craig Rubens), a broad host range plasmid 
containing a temperature sensitive ori, erythromycin resistance gene (erm TS ), and a 
pBS multiple cloning site. The pHY304 vector is a derivative of the vector pWVOl 
(Framson, P.E. et al (1997) Applied Environ Microbiology 63:3539-3547). Plasmids 



WO 02/12294 



PCT/US01/24795 



99 

containing mutant genes are electroporated into strain 874391 and single cross-over 
mutants are selected by antibiotic resistance at 37°C. The resulting antibiotic resistant 
colonies are subjected to a temperature shift to 30°C. Integration of the plasmid is 
unstable at this permissive temperature because there are two functional ori's on the 
5 chromosome. Excised plasmid is eliminated by growth on nonselective media for 
many generations, then colonies are screened for the presence of the mutant allele by 
erythromycin-sensitivity. Double-crossover mutants are stable and do not require 
maintenance under drug selection. The mutant genotype is confirmed by Southern 
blotting or PCR demonstrating the appropriate deletion. The resulting mutants are 

10 screened for the presence of gene expression by Northern and Western blot analysis. 
The phenotype of the knockout mutants is then compared with that of the wild type 
strain 874391 by examining growth rate and colony morphology, and the expression of 
P -hemolysin and CAMP factor. Surface protein expression is assessed by Western 
blot, using polyclonal sera from rabbits immunized with whole, heat-killed type III 

15 GBS. 

c 

In Vitro Models 
A. Adherence 

20 Adhesion of GBS to host cells is a prerequisite for invasive disease. Three different 
cell types have the potential to be important in this process: i) adhesion to respiratory 
epithelial cells is likely to facilitate most early onset neonatal infections, ii) adhesion to 
gastrointestinal epithelial cells has been postulated to be important in the pathogenesis 
of late onset neonatal infections, and iii) adhesion to endothelial cells is necessary for 

25 both endocarditis and other endovascular infections, and is likely to be the initial event 
in GBS meningitis. The ability of wild type and mutant strains to adhere to epithelial 
and endothelial cells is compared in adhesion assays. 

Four different cell lines are used to investigate the role of novel GBS genes in 
30 adhesion. GBS adhere to and invade A549 human alveolar epithelial carcinoma cells 
and surface proteins appear to play an important role in this process (8). GBS binding 
to A549 cells is used as an in vitro model for respiratory colonization. GBS also 
adhere to C2BBeL, a human intestinal epithelial cell line, which is used as a model for 
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gastrointestinal colonization, and to HeLa cervical epithelial cells, a model for genital 
colonization and maternal infection. For endothelial adhesion, two cell lines are 
studied: freshly isolated human umbilical vein endothelial (HUVE) cells; and an 
immortalized human brain microvascular endothelial cell line (BMEC). Adhesion 
5 assays are performed as described by Tamura et al (9). Cell lines are grown to 
confluence in 96-well tissue culture plates in recommended media. Monolayers are 
washed with PBS and fixed with 0.5% gluteraldehyde. Following blocking with 5% 
BSA in PBS, cells are inoculated with various inocula of GBS, centrifuged for 10 
minutes at 2000 rpm and incubated for 1 hour at 4°C. Nonadherent bacteria are 
10 removed by washing three times with 5% nonfat dry milk in PBS and bound bacteria 
are then eluted and plated quantitatively. 



B. Invasion 

GBS adhere to and invade respiratory epithelium, endothelium and BMEC (8, 10, 1 1). 

15 The ability of wild type and isogenic mutant GBS strains to invade the above 

epithelial and endothelial cells are tested as previously described (8, 10, 11). Assays 
that distinguish the ability of GBS to invade eukaryotic cells versus adhere to cells 
capitalize on the inability of penicillin and gentamicin to enter host cells, allowing 
quantification of intracellular bacteria after extracellular bacteria are killed. GBS are 

20 grown to the desired growth phase in TH broth, washed twice with PBS and 

resuspended in tissue culture media containing 1 0% fetal calf serum. Tissue culture 
monolayers grown to confluence in 24-well plates are inoculated with varying inocula 
of GBS, centrifuged at 800xg and incubated at 37°C in 5% C0 2 for 2-6 hours. 
Extracellular bacteria are removed by washing four times with PBS. Cells are then 
25 incubated in fresh medium with 5 mg/ml penicillin and 100 mg/ml gentamicin for 2 
hours. Media is then removed, monolayers washed, and cells lysed by treatment with 
0.025% Triton X-100. Cell lysates are sonicated to disrupt bacterial chains and 
aliquots plated quantitatively. 

30 C. Antibody to GBS Proteins 

The ability of specific antibody to the novel GBS proteins to promote 
opsonophagocytic killing of GBS is tested (12). Rabbits are immunized with 
recombinant or purified GBS proteins produced by standard techniques. Rabbit 
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antiserum of different dilutions (ranging from 1/50 to 1/5,000) that has been 
exhaustively absorbed with the relevant isogenic mutant strain at 4°C will be incubated 
with GBS in the presence of human complement and polymorphonuclear leukocytes (3 
x 10 6 ). Opsonophagocytic killing is expressed as the log number of CFU surviving 
5 following 1 hour of incubation subtracted from the log of the number of CFU at the 
zero time point. Killing of wild type strains is compared to that of isogenic mutants 
lacking novel GBS proteins. 

In Vivo Models 

10 

The neonatal rat has been used by numerous laboratories as a model of GBS infection 
because it closely mimics human neonatal infection (13). The contribution of novel 
genes to the pathogenesis of GBS infections is tested by comparing wild type and 
mutant in this system. Rat pups are inoculated by two routes. First, pups are 
15 inoculated intranasally to mimic the respiratory infection and sepsis typical of early 
onset GBS infection. Secondly, intraperitoneal or subcutaneous inoculation 
reproduces the high grade bacteremia associated with GBS sepsis and that precedes 
GBS meningitis (14). 

20 Rat pups are inoculated with varying doses of GBS strains and mortality is determined. 
The level of bacteremia is determined by quantitative blood cultures. Lung, liver, 
spleen and meningeal tissue are preserved for histologic examination. 

The ability of antiserum to the GBS proteins to protect neonatal rats from GBS 
25 infection is tested (13). Newborn rats (<18 hours old) receive an intraperitoneal 
injection of 0.5 ml of undiluted rabbit antiserum, followed by the intraperitoneal 
inoculation of the equivalent of one LD50 unit of GBS (usually about 5000 bacteria) 
in PBS. Mortality and morbidity are then determined. 

30 Role of Novel GBS Proteins in Vaccines 



Several surface proteins of GBS, including C and Rib are immunogenic and protective 
against GBS infection in infant rodent models (25, 26). None of these proteins are 
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present in all GBS strains (27). Furthermore, each of these proteins has a repetitive 
structure. The phenotypic variability of these repetitive proteins allows escape 
mutants expressing variant forms to evade host immune systems and may limit the 
effectiveness of these vaccines (28). It is notable that each of the predicted proteins of 
5 the spb and ema loci do not have a repetitive structure and would not have this 
disadvantage. 



The novel GBS proteins we describe here may be useful antigens for a GBS vaccine. 
The data presented herein indicates these proteins have a role in mediating adhesion to 

10 and invasion of GBS to human epithelial cells, thus antibody against these antigens 
may prevent these initial steps in infection. It is highly desirable to develop a vaccifie 
that prevents colonization of pregnant women and other individuals at increased risk 
of invasive GBS infection, as this would eliminate most infections. Our data suggests 
that antibody against Spbl is effective in reducing colonization or infection following 

15 colonization with highly virulent strains of serotype III, and therefore this protein is a 
particularly useful vaccine antigen. Members of the ema locus, unlike spbl and spbl, 
are ubiquitous in GBS and therefore have a role in the prevention of infection by 
multiple serotypes of GBS. An optimal vaccine formulation includes combinations of 
these antigens. 

20 

Two strategies are used to design GBS vaccines using these novel proteins. First, 
purified recombinant or affinity-purified proteins are used as vaccine antigens, singly 
or in combination (25). Second, these proteins are used as carrier proteins for 
capsular polysaccharide or oligosaccharide-based vaccines. GBS polysaccharides and 

25 oligosaccharides are generally poorly immunogenic and fail to elicit significant memory 
and booster responses (29). Conjugation of these polysaccharides or oligosaccharides 
to protein carriers increases immunogenicity. GBS polysaccharide conjugated to 
tetanus toxoid, for example, has been used to immunize pregnant women and results in 
high levels of maternal serum anti-poly saccharide antibody which may be transferred 

30 to the fetus in the third trimester of pregnancy (30). Selection of appropriate carrier 
proteins is important for the development of polysaccharide-protein vaccine 
formulations. For example, Haemophilus influenzae type b poly- or oligosaccharide 
conjugated to different protein carriers has variable immunogenicity and elicits 
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antibody with varying avidity (3 1, 32). Repeated immunization with the same carrier 
protein may also suppress immune responses by competition for specific B cells 
(epitopic suppression) or other mechanisms. This is of particular concern for the 
development of GBS vaccines since recently developed polyaccharide and 
5 oligosaccharide-protein conjugate vaccines against the bacteria H, influenzae, S. 
pneumoniae, and N. meningitidis all utilize a restricted number of carrier proteins 
(tetanus toxoid, CRM 197, diptheria toxoid), increasing the number of exposures to 
these carriers an individual is likely to recieve. A "designer" vaccine, composed of a 
GBS polysaccharide or oligosaccharide coupled to a GBS-specific carrier protein, 
10 such as the novel GBS polypeptides, provided herein, particularly including Spbl, 
EmaC and EmaE, may be a preferable strategy. The large size of certain of these 
novel GBS antigens may also be an advantage to traditional carrier proteins as 
increasing size is associated with improved immunogenicity. 

15 EXAMPLE 3 

EMA HOMOLOGS IN STREPTOCOCCI AND OTHER BACTERIA 

As noted above, the GBS Ema proteins share segmental homology with certain 
characterized proteins from other bacterial species, including bacterial adhesion and 

20 invasion proteins. The segmental homolog is noted as in the range of 24-39%. In 
addition, the Ema proteins demonstrate some homology to one another. A 
comparison of the ema genes shows that EmaA and EmaB are 47% homologous, 
however, due to the difference in their predicted lengths it is necessary to insert gaps 
in the EmaA sequence in order to line them up. The two Ema proteins which are most 

25 similar in structure, EmaC and EmaD share 48.7% amino acid homology to one 

another. EmaA/B, EmaC/D and EmaE are each < 20% homologous to one another. 

The ema sequences were used to search the unannotated microbial genomes 
(Eubacteria). The predicted Ema proteins were searched against translations in all six 
30 frames (tblast x) of finished and unfinished unannotated microbial genomes available at 
the web site of the National Center for Biotechnology Information (NCBI). 
Segmental amino acid homolog was identified. 
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EmaA has some segmental homolog with S. pneumoniae, E. faecalis, B. anthracis and 
C. diptheriae. Ema B has some segmental homolog with B. anthracis. EmaE has 
segmental homology to S. pyogenes and lesser homology to B. anthracis. 

5 Significant homology was identified between the GBS EmaC and EmaD and proteins 
in other bacterial species. EmaC has significant (55% identity over 149 amino acids) 
homology to a region of the S. pneumoniae chromosome and the S. pyogenes 
chromosome (47% identity over 150 amino acids). Lesser segmental homology was 
found to E. faecalis, S. equi, and C. diptheriae. EmaD has strong segmental 
10 homology (66% over 184 amino acids) to a region of the S. pneumoniae chromosome, 
and lesser segmental homology to C diphtheriae and S. pyogenes. 

We have identified two Ema homologs in S. pneumoniae. These S. pneumoniae 
homologs show homology to EmaC and EmaD and, like EmaC and EmaD, also 

15 demonstrate homology to fimbria-associated protein of Actinomyces. The encoding 
nucleic acid and predicted amino acid sequence of the first S. pneumoniae EmaC/D 
homolog are provided in SEQ ID NOS: 24 and 23, respectively. The genome region 
nucleic acid including the first homolog encoding sequence is provided in SEQ ID NO: 
22. The nucleic acid and predicted amino acid sequence of the second S. pneumoniae 

20 EmaC/D homolog are provided in SEQ ID NOS: 27 and 26 respectively. The 
genomic region nucleic acid of this second homolog is found in SEQ ID NO: 25. 
An EmaC/D homolog has been identified in Enterococcus faecalis by search and 
analysis. The E. faecalis EmaC/D homolog predicted amino acid sequence is provided 
in SEQ ID NO: 29. The nucleic acid sequence encoding this E. faecalis Ema homolog 

25 is provided in SEQ ID NO: 30. The nucleic acid sequence of E. faecalis which 
genomic region encodes the EmaC/D homolog is provided in SEQ ID NO: 28. 

We have also identified an EmaD homolog in Corynebacterium diptheriae. The 
predicted amino acid sequence of the C. diptheriae EmaD homolog is provided in 
30 SEQ ID NO: 32. C. diptheriae nucleic acid sequence which encodes the homolog is 
found in SEQ ID NO: 33. The corresponding genomic region sequence of C. 
diptheriae is provided in SEQ ID NO: 31. 
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A predicted EmaC/D homolog has been identified in S. pyogenes. The predicted 
partial amino acid sequence of this Ema homolog provided in SEQ ED NO: 37. 

A region of amino acids TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34) is 
5 found in GBS EmaC, GBS EmaD, in both the EmaC/D homologs of & pneumoniae , 
and in the E. faecalis Ema homolog. A similar sequence 

TLVTCTPYGINTHRLLVTA (SEQ ID NO: 35) is also found in the C. diptheriae 
Ema homolog. The & pyogenes predicted Ema homolog has a similar sequence 
TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36) as well. 

10 
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This invention may be embodied in other forms or carried out in other ways without 
departing from the spirit or essential characteristics thereof. The present disclosure is 
20 therefore to be considered as in all aspects illustrate and not restrictive, the scope of 
the invention being indicated by the appended Claims, and all changes which come 
within the meaning and range of equivalency are intended to be embraced therein. 

Various references are cited throughout this Specification, each of which is 
25 incorporated herein by reference in its entirety. 
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WHAT IS CLAIMED IS : 

1 . An isolated streptococcal polypeptide EmaA. 

2. The EmaA polypeptide of Claim 1 which comprises the amino acid sequence 
set out in SEQ ID NO: 2, and analogs, variants and immunogenic fragments 
thereof. 

r 

3. An isolated streptococcal polypeptide EmaB. 

« 

4. The EmaC polypeptide of Claim 3 which comprises the amino acid sequence 
set out in SEQ ID NO: 4, and analogs, variants and immunogenic fragments 
thereof, 

5. An isolated streptococcal polypeptide EmaC. 

6. The EmaC polypeptide of Claim 5 which comprises the amino acid sequence 
set out in SEQ ID NO: 6, and analogs, variants and immunogenic fragments 
thereof. 

7. An isolated streptococcal polypeptide EmaD. 

8. The EmaD polypeptide of Claim 7 which comprises the amino acid sequence 
set out in SEQ ID NO: 8, and analogs, variants and immunogenic fragments 
thereof. 

9. An isolated streptococcal polypeptide EmaE. 

10. The EmaE polypeptide of Claim 9 which comprises the amino acid sequence 
set out in SEQ ID NO: 10, and analogs, variants and immunogenic fragments 
thereof. 
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i 

1 1 . The streptococcal polypeptide of any of Claims 1, 3, 5, 7 or 9 labeled with a 
detectable label 

12. A vaccine comprising one or more streptococcal polypeptides selected from 
the group of EmaA, EmaB, EmaC, EmaD and EmaE, and a pharmaceutically 
acceptable adjuvant. 

13. The vaccine of Claim 12, further comprising an antigen selected from the 
group consisting of: 

a. the polypeptide Spbl or an immunogenic fragment thereof; 

« 

b. the polypeptide Spb2 or an immunogenic fragment thereof; 

c. the polypeptide C protein alpha antigen or an immunogenic fragment 
thereof; 

d. the polypeptide Rib or an immunogenic fragment thereof; 

e. the polypeptide Lmb or an immunogenic fragment thereof; 

f. the polypeptide C5a-ase or an immunogenic fragment thereof; 

g. Group B streptococcal polysaccharides or oligosaccharides; and 

h. any combination of one or more of the foregoing. 

14. An immunogenic composition comprising one of more streptococcal 
polypeptides selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE, and a pharmaceutically acceptable adjuvant. 

1 5. The immunogenic composition of Claim 14, further comprising an antigen 
selected from the group consisting of: 

a. the polypeptide Spbl of an immunogenic fragment thereof; 

b. the polypeptide Spb2 or an immunogenic fragment thereof; 

c. the polypeptide C protein alpha antigen or an immunogenic fragment 
thereof; 

d. the polypeptide Rib or an immunogenic fragment thereof; 

e. the polypeptide Lmb or an immunogenic fragment thereof; 

f. the polypeptide C5a-ase or an immunogenic fragment thereof; 

g. Group B streptococcal polysaccharides or oligosaccharides; and 
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h. any combination of one or more of the foregoing. 

16. A pharmaceutical composition comprising one or more streptococcal 
polypeptides selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE, and a pharmaceutical^ acceptable carrier. 

17. The pharmaceutical composition of Claim 16, further comprising an active 
ingredient selected from the group consisting of: 

a. Spbl or Spb2 polypeptide; 

b. C protein alpha antigen; 

c. Rib polypeptide; 

d. Lmb polypeptide; 

e. CSa-ase polypeptide; 

£ . a Group B streptococcal polysaccharide or oligosaccharide; and 

g. an anti-streptococcal vaccine. 

18. A purified antibody to a streptococcal polypeptide selected from the group of 
EmaA, EmaB, EmaC, EmaD and EmaE. 

19. A monoclonal antibody to a streptococcal polypeptide selected from the group 
of EmaA, EmaB, EmaC, EmaD and EmaE. 

20. An immortal cell line that produces a monoclonal antibody according to Claim 
19. 

2 1 The antibody of any of Claims 1 9 or 20 labeled with a detectable label. 

22. The antibody of Claim 21 wherein the label is selected from the group 
consisting of an enzyme, a chemical which fluoresces, and a radioactive 
element. 
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A pharmaceutical composition comprising one or more antibodies to a 
streptococcal protein selected from the group of EmaA, EmaB, EmaC, EmaD 
and EmaE, and a pharmaceutically acceptable carrier. 

A pharmaceutical composition comprising a combination of at least two 
antibodies to streptococcal proteins and a pharmaceutically acceptable carrier, 
wherein at least one antibody to a protein selected from the group of EmaA, 
EmaB, EmaC, EmaD and EmaE, is combined with at least one antibody to a 
protein selected from the group of Spbl and Spb2, Rib, Lmb, C5a-ase and C 
protein alpha antigen. 

An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
1 , or a fragment thereof. 

The isolated nucleic acid of Claim 25, wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO : 1 ; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 
by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

f. hybridizable fragments thereof. 

An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 

3. 

The isolated nucleic acid of Claim 27, wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO : 3 ; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 
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c. DNA sequences capable of encoding the amino acid sequence encoded 
by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

f. hybridizable fragments thereof 



29. An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
5. 



30. The isolated nucleic acid of Claim 29, wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO: 5; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 
by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

f. hybridizable fragments thereof 



31. An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
7. 



The isolated nucleic acid of Claim 3 1 , wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO: 7; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 
by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

» 

f. hybridizable fragments thereof 
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An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
9. 

The isolated nucleic acid of Claim 33, wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO: 9; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 

« 

by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

f hybridizable fragments thereof 

A vector which comprises the nucleic acid of any of Claims 25, 27, 29, 3 1 or 
33 and a promoter. 

The vector of Claim 35, wherein the promoter comprises a bacterial, yeast, 
insect or mammalian promoter. 

The vector of Claim 35, wherein the vector is a plasmid, cosmid, yeast artificial 
chromosome (YAC), bacteriophage or eukaryotic viral DNA. 

A host vector system for the production of a polypeptide which comprises the 
vector of Claim 35 in a suitable host cell. 

The host vector system of Claim 38, wherein the suitable host cell comprises a 
prokaryotic or eukaryotic cell 

The nucleic acid of any of Claims 25, 27, 29, 31 or 33 which is a recombinant 
DNA molecule. 
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41 . The recombinant DNA molecule of Claim 40, wherein the DNA molecule is 
operatively linked to an expression control sequence, 

42. A unicellular host transformed with a recombinant DNA molecule of Claim 40. 



43 . A nucleic acid vaccine comprising the recombinant DNA molecule of Claim 
40. 



44. A method for detecting the presence of a streptococcal polypeptide selected 
from the group of EmaA, EmaB, EmaC, EmaD and EmaE, wherein the 
streptococcal polypeptide is measured by: 

a. contacting a sample in which the presence or activity of a streptococcal 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD 
and EmaE is suspected with an antibody to the said streptococcal 
polypeptide under conditions that allow binding of the streptococcal 
polypeptide to antibody to occur; and 

b. detecting whether binding has occurred between the streptococcal 
polypeptide from the sample and the antibody; 

wherein the detection of binding indicates the* presence or activity of the streptococcal 
polypeptide in the sample. 



45. A method for detecting the presence of a bacterium having a gene encoding a 
streptococcal polypeptide selected from the group of emaA, emaB, emaC, 
emaD and emaE, comprising: 

a. contacting a sample in which the presence or activity of the bacterium 
is suspected with an oligonucleotide which hybridizes to a 
streptococcal polypeptide gene selected from the group ofemaA, 
emaB, emaC, emaD and emaE, under conditions that allow specific 
hybridization of the oligonucleotide to the gene to occur; and 

b. detecting whether hybridization has occurred between the 
oligonucleotide and the gene; 

wherein the detection of hybridization indicates that presence or activity of the 
bacterium in the sample. 
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46. A method for preventing infection with a bacterium that expresses a 
streptococcal Ema polypeptide comprising administering an immunogenically 
effective dose of a vaccine of Claim 12 to a subject. 

47. A method for preventing infection with a bacterium that expresses a 
streptococcal Ema polypeptide comprising administering an immunogenically 
effective dose of the immunogenic composition of Claim 14 to a subject. 

48. A method for treating infection with a bacterium that expresses a streptococcal 
Ema polypeptide comprising administering a therapeutically effective dose of a 
pharmaceutical composition of Claim 16 to a subject. 

49. A method for treating infection with a bacterium that expresses a streptococcal 
Ema polypeptide comprising administering a therapeutically effective dose of a 
pharmaceutical composition of Claim 23 to a subject. 

50. A method of inducing an immune response in a subject which has been exposed 
to or infected with a streptococcal bacterium comprising administering to the 
subject an amount of the pharmaceutical composition of Claim 16, thereby 
inducing an immune response. 

51. A method for preventing infection by a streptococcal bacterium in a subject 
comprising administering to the subject an amount of a pharmaceutical 
composition of Claim 23 and a pharmaceutically acceptable carrier or diluent, 
thereby preventing infection by a streptococcal bacterium. 

52. An isolated streptococcal Ema polypeptide comprising the amino acid 
sequence set out in SEQ ID NO:23. 

53. An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
52. 



* 
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5 54. The isolated nucleic acid of Claim 53, wherein the nucleic acid is selected from 

the group consisting of: 

a. the DNA sequence of SEQ ID NO: 24; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 

10 c. DNA sequences capable of encoding the amino acid sequence encoded 

by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

f. hybridizable fragments thereof. 



15 55. An isolated streptococcal Ema polypeptide comprising the amino acid 

sequence set out in SEQ ID NO:26. 



56. An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
55. 



57. The isolated nucleic acid of Claim 56, wherein the nucleic acid is selected from 
20 the group consisting of: 

a. the DNA sequence of SEQ ID NO: 27; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 
25 by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

f. hybridizable fragments thereof. 



58. An isolated streptococcal Ema polypeptide comprising the amino acid 
30 sequence set out in SEQ ID NO:37. 



59. 



An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
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60. An enterococcal Ema polypeptide comprising the amino acid sequence set out 
in SEQ ID NO:29. 



35 61 . An isolated nucleic acid which encodes the enterococcal polypeptide of Claim 

60. 



62. The isolated nucleic acid of Claim 61, wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO: 30; 

40 b. DNA sequences that hybridize to the sequence of subpart (a) under 

« 

moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 
by the DNA sequences of (a) or (b); 

d . degenerate variants thereof; 
45 e. alleles thereof; and 

f. hybridizable fragments thereof 



63 . An isolated Corynebacterium Ema polypeptide comprising the amino acid 
sequence set out in SEQ ID NO: 32. 



64. An isolated nucleic acid which encodes the Corynebacterium polypeptide of 
50 Claim 63. 



65. The isolated nucleic acid of Claim 64, wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO: 33; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
55 moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 
by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

60 f hybridizable fragments thereof 
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66. An isolated bacterial polypeptide comprising the amino acid sequence 
TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34), wherein the polypeptide 
is not isolated from Actinomyces . 

67. An isolated streptococcal polypeptide comprising the amino acid sequence 



69. An isolated bacterial polypeptide comprising the amino acid sequence 
TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36). 

70 70. An isolated streptococcal polypeptide comprising the amino acid sequence 
TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36). 

7 1 . An isolated polypeptide having the amino acid sequence selected from the 
group of 

TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34), TLVTCTPYGINTHRLLVTA 
75 (SEQ ID NO: 35), and TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36)'. 



65 



TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34). 



68. 



An isolated bacterial polypeptide comprising the amino acid sequence 
TLVTCTPYGINTHRLLVTA (SEQ ID NO: 35). 



* 
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Figure 1. RDP type III-3 specific probes. Dot 
blot hybridization of probe DY1-1 with genomic 
DNA isolated from type III GBS. 10 ug of genomic 
DNA from each of 62 type Hi GBS strains was 
transferred to nylon membrane. Radiolabeled 
probe 1 hybridized with DNA from all III-3 strains 
(rows A-D) including the original type IH-3 strain 
(well E-1). The probe failed to hybridize with DNA 
from IH-2 strains (F1- F10, G1-7) including the 
original strain used in the subtraction hybridization 
(well E 10) and ltl-1 strains (wells H1-3; cf. Figure 
3). The same pattern of hybridization was 
observed using probe DY1-11. 
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atg acc ctt gtt aaa aat caa gat get ctt gat aaa get act gca aat 
Met Thr Leu Val Lys Asn Gin Asp Ala Leu Asp Lys Ala Thr Ala Asn 
15 10 1S 

aca gat gat gcg gca ttt ttg gaa att cca gtt gca tea act att aat 
Thr Asp Asp Ala Ala Phe Leu Glu He Pro Val Ala Ser Thr He Asn 

20 25 30 

gaa aaa gca gtt tta gga aaa gca att gaa aat act ttt gaa ctt caa 
Glu Lys Ala Val Leu Gly Lys Ala lie Glu Asn Thr Phe Glu Leu Gin 
35 40 45 

tat gac cat act cct gat aaa get gac aat cca aaa cca tct aat cct 
Tyr Asp His Thr Pro Asp Lys Ala Asp Asn Pro Lys Pro Ser Asn Pro 
50 55 60 

cca aga aaa cca gaa gtt cat act ggt ggg aaa cga ttt gta aag aaa 
Pro Arg Lys Pro Glu Val His Thr Gly Gly Lys Arg Phe Val Lys Lys 
65 70 75^ 80 



n 
» 



gac tea aca gaa aca caa aca eta ggt ggt get gag ttt gat ttg ttg 
Asp Ser Thr Glu Thr Gin Thr Leu Gly Gly Ala Glu Phe Asp Leu Leu 

85 90 95 



145 



tta aaa gaa aca aaa gca cca gaa ggt tat gta ate cct gat aaa gaa 
Leu Lys Glu Thr Lys Ala Pro Glu Gly Tyr Val He Pro Asp Lys Glu 

165 170 175 

ate gag ttt aca gta tea caa aca tct tat aat aca aaa cca act gac 
He Glu Phe Thr Val Ser Gin Thr Ser Tyr Asn Thr Lys Pro Thr Asp 

180 185 190 

ate acg gtt gat agt get gat gca aca cct gat aca att aaa aac aac 
He Thr Val Asp Ser Ala Asp Ala Thr Pro Asp Thr He Lys Asn Asn 
X95 200 205 

aaa cgt cct tea ate cct aat act ggt ggt att ggt acg get ate ttt 
Lys Arg Pro Ser He Pro Asn Thr Gly Gly He Gly Thr Ala He Phe 
210 215 220 

gtc get ate ggt get gcg gtg atg get ttt get gtt aag ggg atg aag 
Val Ala He Gly Ala Ala Val Met Ala Phe Ala val Lys Gly Met Lys 
225 230 235 240 

cgt cgt aca aaa gat aac taa 
Arg Arg Thr Lys Asp Asn 

245 



48 



96 



144 



192 



240 



288 



384 



get tct gat ggg aca gca gta aaa tgg aca gat get ctt att aaa gcg 33$ 
Ala Ser Asp Gly Thr Ala Val Lys Trp Thr Asp Ala Leu He Lys Ala 

100 105 HO 

aat act aat aaa aac tat att get gga gaa get gtt act ggg caa cca 
Asn Thr Asn Lys Asn Tyr He Ala Gly Glu Ala Val Thr Gly Gin Pro 
115 120 125 

ate aaa ttg aaa tea cat aca gac ggt acg ttt gag att aaa ggt ttg 
He Lys Leu Lys Ser His Thr Asp Gly Thr Phe Glu He Lys Gly Leu 
130 135 140 

get tat gca gtt gat gcg aat gca gag ggt aca gca gta act tac aaa 
Ala Tyr Ala Val Asp Ala Asn Ala Glu Gly Thr Ala Val Thr Tyr Lys 

150 155 160 



432 
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atg aaa caa aca tta aaa ctt atg ttt tct ttt ctg ttg atg tta ggg 
Met Lys Gin Thr Leu Lys Leu Met Phe Ser Phe Leu Leu Met Leu Gly 
1 5 10 15 

act atg ttt gga att age caa act gtt tta gcg caa gaa act cat cag 
Thr Met Phe Gly He Ser Gin Thr Val Leu Ala Gin Glu Thr His Gin 

20 25 30 

ttg acg att gtt cat ctt gaa gca agg gat att gat cgt cca aat cca 
Leu Thr lie Val His Leu Glu Ala Arg Asp He Asp Arg Pro Asn Pro 
35 40 45 

cag ttg gag att gec cct aaa gaa ggg act cca att gaa gga gta etc 
Gin Leu Glu He Ala Pro Lys Glu Gly Thr Pro He Glu Gly Val Leu 
50 55 60 

tat cag ttg tac caa tta aaa tea act gaa gat ggc gat ttg ttg gca 
Tyr Gin Leu Tyr -Gin Leu Lys Ser Thr Glu Asp Gly Asp Leu Leu Ala 
65 70 75 80 

cat tgg aat tec eta act ate aca gaa ttg aaa aaa cag gcg cag cag 
His Trp Asn Ser Leu Thr He Thr Glu Leu Lys Lys Gin Ala Gin Gin 

85 90 95 

gtt ttt gaa gec act act aat caa caa gga*aag get aca ttt aac caa 
Val Phe Glu Ala Thr Thr Asn Gin Gin Gly Lys Ala Thr Phe Asn Gin 

100 105 HO 

eta cca gat gga att tat tat ggt ctg gcg gtt aaa gec ggt gaa aaa 
Leu Pro Asp Gly He Tyr Tyr Gly Leu Ala Val Lys Ala Gly Glu Lys 
115 120 125 

aat cgt aat gtc tea get ttc ttg gtt gac ttg tct gag gat aaa gtg 
Asn Arg Asn Val Ser Ala Phe Leu Val Asp Leu Ser Glu Asp Lys Val 
130 135 140 

att tat cct aaa ate ate tgg tec aca ggt gag ttg gac ttg ctt aaa 
He Tyr Pro Lys He He Trp Ser Thr Gly Glu Leu Asp Leu Leu Lys 
145 150 155 160 

gtt ggt gtg gat ggt gat ace aaa aaa cca eta gca ggc gtt gtc ttt 
Val Gly Val Asp Gly Asp Thr Lys Lys Pro Leu Ala Gly Val Val Phe 

165 170 175 

gaa ctt tat gaa aag aat ggt agg act cct att cgt gtg aaa aat ggg 
Glu Leu Tyr Glu Lys Asn Gly Arg Thr Pro He Arg Val Lys Asn Gly 

180 185 190 

gtg cat tct caa gat att gac get gca aaa cat tta gaa aca gat tea 
Val His Ser Gin Asp lie Asp Ala Ala Lys His Leu Glu Thr Asp Ser 
195 200 205 

tea ggg cat ate aga att tec ggg etc ate cat ggg gac tat gtc tta 
Ser Gly His He Arg He Ser Gly Leu He His Gly Asp Tyr Val Leu 
210 215 220 

aaa gaa ate gag aca cag tea gga tat cag ate gga cag gca gag act 
Lys Glu He Glu Thr Gin Ser Gly Tyr Gin He Gly Gin Ala Glu Thr 
225 230 235 240 

get gtg act att gaa aaa tea aaa aca gta aca gta acg att gaa aat 
Ala Val Thr He Glu Lys Ser Lys Thr Val Thr Val Thr He Glu Asn 

245 250 255 

aaa aaa gtt ccg aca cct aaa gtg cca tct cga gga ggt ctt att ccc 
Lys Lys Val Pro Thr Pro Lys Val Pro Ser Arg Gly Gly Leu He Pro 

260 265 270 
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aaa aca ggt gag caa cag 
Lys Thr Gly Glu Gin Gin 
275 

tta att get tta gec tta 
Leu He Ala Leu Ala Leu 
290 

aat aag gat tag 
Asn Lys Asp 
305 



gca atg gca ctt gta att 
Ala Met Ala Leu Val He 
260 

cga tta eta tea aaa cat 
Arg Leu Leu Ser Lys His 
295 300 



att ggt ggt att 864 

He Gly Gly He 

285 

egg aaa cat caa 912 
Arg Lys His Gin 



924 
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ata aaa caa aaa tea aaa ata tct eta get acg aat att cgt ata tgg 

Met Gly Gin Lys Ser Lys lie Ser Leu Ala Thr Aan lie Arg lie Trp 

^ 5 1 

att ttt cgt tta att ttc tta gcg ggt ttc ctt gtt ttg gca ttt ccc 

lie Phe Arg Leu lie Phe Leu Ala Gly Phe Leu Val Leu Ala Phe Pro 

20 25 30 

ate att agt cag gtc atg tac ttt caa gec tct cac gec aat att aat 

lit vll Ser Gin Val Met Tyr Phe Gin Ala Ser His Ala Asn He Asn 

35 40 45 

act ttt aaa gaa get gtt acc aag att gac egg gtg gag att aat egg 

III Phe Lys Glu Ala Val Thr Lys lie Asp Arg Val Glu lie Asn Arg 

cat tta gaa ctt get tat get tat aac gec agt ata gca ggt gec aaa 

Arg Leu Glu Leu Ala Tyr Ala Tyr Asn Ala Ser He Ala Gly Ala Lys 

€5 70 75 

act aat ggc gaa tat cca gcg ctt aaa gac ccc tac tct get gaa caa 

?£r Asn Gly Glu Tyr Pro Ala Leu Lys Asp Pro Tyr Ser Ala Glu Gin 

aaa cag gca ggg gtc gtt gag tac gec cgc atg et,t gaa gtc aaa gaa 

Lys Gin Ala Gly Val Val Glu Tyr Ala Arg Met L*i Glu Val Lys Glu 

100 105 110 

r-aA ata aat cat gtg att att cca aga att aat cag gat ate cct att 

Gin lie Gly His Val He lie Pro Arg He As* Gin Asp He Pro He 

115 120 125 

tac act age tct get gaa gaa aat ctt cag agg ggc gtt gga cat tta 

£vr Ala Gly Ser Ala Glu Glu Asn Leu Gin Arg Gly Val Gly Hxs Leu 



130 



aaa aaa acc agt ctt cca gtc ggt ggt gag tea act cat gec gtt eta 
Glu III Thr Ser Leu Pro Val Gly Gly Glu Ser Thr His Ala Val Leu 

150 155 180 



145 



act acc cat cga ggg eta cca acg gec aag eta ttt acc aat tta gac 
?hr fit Til Arg Gly Leu Pro Thr Ala Lys Leu Phe Thr Asn Leu As P 

165 I 70 175 

aaa ata aca gta ggt gac cgt ttt tac att gaa cac ate ggc gga aag 
Ly! vll xhr vtl liy Asp Arg Phe Tyr He Glu His He Gly Gly Ly S 
Y 180 185 190 

att act tat cag gta gac caa ate aaa gtt ate gee cct gat cag tta 
He Ala Tyr Gin val Asp Gin He. Lys Val He Ala Pro Asp Gin Leu 
19 5 200 205 

aaq qat ttg tac gtg att caa gga gaa gat cac gtc acc eta tta act 
Glu Asp Leu Tyr Val He Gin Gly Glu Asp His Val Thr Leu Leu Thr 

215 220 



tgc aea cct tat atg . ata aat agt eat cgc etc etc gtt cga ggc aag 

Cys Thr Pro Tyr Met He Asn Ser His Arg Leu Leu Val Arg Gly Lys 

225 230 235 

cga att cct tat gtg gaa aaa aca gtg cag aaa gat tea aag acc ttc 

Ara He Pro Tyr Val Glu Lys Thr Val Gin Lys Asp Ser Lys Thr Phe 

245 250 255 

agg caa caa caa tac eta acc tat get atg tgg gta gtc gtt gga ctt 

A?g Gin Gin Gin Tyr Leu Thr Tyr Ala Met Trp Val Val val Gly Leu 

260 265 270 

ate ttg ctg teg ctt etc att tgg ttt aaa aag acg aaa cag aaa aag 
He Leu Leu Ser Leu Leu He Ttp Phe Lys Lys Thr Lys Gin Lys Lys 

275 280 285 

egg aga aag aat gaa aaa gcg get agt caa aat agt cac aat aat teg 

Arg Arg Lys Asn Glu Lys Ala ALa Ser Gin Asn Ser His Asn Asn Ser 

290 295 300 



aaa taa 

Lys 

305 
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EmaO 

atg aaa aag egg eta gtc aaa ata gtc aca ata att cga aat aat aaa 
Met Lys Lys Arg Leu Val Lys lie Val Thr lie lie Arg Asn Asn Lys 
1 5 10 15 



caa ttt aag egg gaa gtc get aag att gat act aat acg gtt gaa cga 
Gin Phe Lys Arg Glu Val Ala Lys lie Asp Thr Asn Thr Val Glu Arg 
50 55 60 



ttg ctt ata gac cct ttt acc agt aag caa aaa gaa ggt ttg aga gag 
Leu Leu lie Asp Pro Phe Thr Ser Lys Gin LyS Glu Gly Leu Arg Glu 

85 90 95 



ata atg atg aga aga tgg atg caa cat cgt caa taa 
He Met Met Arg Arg Trp Met Gin His Arg Gin 



48 



ate aga acc etc att ttt gtg atg gga agt ctg att etc tta ttt ccg 96 
lie Arg Thr Leu He Phe Val Met Gly Ser Leu He Leu Leu Phe Pro 

20 25 30 

att gtg age cag gta agt tac tac ctt get teg cat caa aat att aat 144 
He Val Ser Gin Val Ser Tyr Tyr Leu Ala Ser His Gin Asn He Asn 
35 40 45 



192 



cgc ate get tta get aat get tac aat gag acg tta tea agg aat ccc 240 
Arg He Ala Leu Ala Asn Ala Tyr Asn Glu Thr Leu Ser Arg Asn Pro 
65 70 75 80 



288 



tat get cgt atg ctt gaa gtt cat gag caa ata ggt cat gtg gca ate 336 
Tyr Ala Arg Met Leu Glu Val His Glu Gin lie Gly His Val Ala He 

100 105 110 

cca agt at.t ggg gtt gat att cca att tat get gga aca tec gaa act ' 384 
Pro Ser He Gly Val Asp He Pro He Tyr Ala Gly Thr Ser Glu Thr 
115 120 125 

gtg ctt cag aaa ggt agt ggg cat ttg gag gga acc agt ctt cca gtg 432 
Val Leu Gin Lys Gly Ser Gly His Leu Glu Gly Thr Ser Leu Pro Val 
130 135 140 

gga ggt ttg tea acc cat tea gta eta act gec cac cgt ggc ttg cca 480 
Gly Gly Leu Ser Thr His Ser Val Leu Thr Ala His Arg Gly Leu Pro 
145 150 155 160 

aca get agg eta ttt acc gac tta aat aaa gtt aaa aaa ggc cag att 528 
Thr Ala Arg Leu Phe Thr Asp Leu Asn Lys Val Lys Lys Gly Gin He 

165 170 175 

ttc tat gtg acg aac ate aag gaa aca ctt gee tac aaa gtc gtg tct 57 6 
Phe Tyr Val Thr Asn He Lys Glu Thr Leu Ala Tyr Lys Val Val Ser 

180 185 190 

ate aaa gtt gtg gat cca aca get tta agt gag gtt aag att gtc aat 624 
He Lys Val Val Asp Pro Thr Ala Leu Ser Glu Val Lys He Val Asn 
195 200 205 

ggt aag gat tat ata acc ttg ctg act tgc aca cct tac atg ate aat 672 
Gly Lys Asp Tyr He Thr Leu Leu Thr Cys Thr Pro Tyr Met He Asn 
210 215 220 

agt cat cgt etc ttg gta aaa gga gag cgt att cct tat gat tct acc 720 
Ser His Arg Leu Leu Val Lys Gly Glu Arg He Pro Tyr Asp Ser Thr 
225 230 235 240 

gag gcg gaa aag cac aaa gaa caa acc gta caa gat tat cgt ttg tea 768 
Glu Ala Glu Lys His Lys Glu Gin Thr Val Gin Asp Tyr Arg Leu Ser 

245 250 255 

eta gtg ttg aag ata eta eta gta tta tta att gga etc ttc ate gtg 816 
Leu Val Leu Lys He Leu Leu Val Leu Leu He Gly Leu Phe He Val 

260 265 270 



852 
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EmaE 

atg atg att gtg aat aat ggt tat eta gaa ggg aga aaa atg aaa aag 
Met Met lie Val Asn Asn Gly Tyr Leu Glu Gly Arg Lys Met Lys Lys 
1 5 10 15 



caa att cca ttt ggt ata ttg gta caa ggt gaa acc caa gat acc aat 

Gin lie Pro Phe Gly lie Leu Val Gin Gly Glu Thr Gin Asp Thr Asn 

35 40 45 

caa gca ctt gga aaa gta att gtt aaa aaa acg gga gac aat get aca 

Gin Ala Leu Gly Lys Val lie Val Lys Lys Thr Gly Asp Asn. Ala Thr 

50 55 60 

cca tta ggc aaa gcg act ttt gtg tta aaa aat gac aat gat aag tea 

Pro Leu Gly Lys Ala Thr Phe Val Leu Lys Asn Asp Asn Asp Lys Ser 

65 70 75 80 

gaa aca agt cac gaa acg gta gag ggt tct gga gaa gca acc ttt gaa 

Glu Thr Ser His Glu Thr Val Glu Gly Ser Gly^GJ-U Ala Thr Phe Glu 

85 90 95 

aac ata aaa cct gga gac tac aca tta aga gaa gaa aca gca cca att 

Asn lie Lys Pro Gly Asp Tyr Thr Leu Arg Glu Glu Thr Ala Pro He 

100 105 110 

ggt tat aaa aaa act gat aaa acc tgg aaa gtt aaa gtt gca gat aac 

Gly Tyr Lys Lys Thr Asp Lys Thr Trp Lys Val Lys Val Ala Asp Asn 

115 ' 120 125 

gga gca aca ata ate gag ggt atg gat gca gat aaa gca gag aaa cga 

Gly Ala Thr He He Glu Gly Met Asp Ala Asp Lys Ala Glu Ly3 Arg 

130 135 140 

aaa gaa gtt ttg aat gec caa tat cca aaa tea get att tat gag gat 

Lys Glu Val Leu Asn Ala Gin Tyr Pro Lys Ser Ala He Tyr Glu Asp 

145 150 155 160 

aca aaa gaa aat tac cca tta gtt aat gta gag ggt. tec aaa gtt ggt 

Thr Lys Glu Asn Tyr Pro Leu Val Asn Val Glu Gly Ser Lys Val Gly 

165 170 175 

gaa caa tac aaa gca ttg aat cca ata aat gga aaa gat ggt cga aga 

Glu Gin Tyr Lys Ala Leu Asn Pro He Asn Gly Lys Asp Gly Arg Arg 

180 185 190 

gag att get gaa ggt tgg tta tea aaa aaa aat aca ggg gtc aat gat 

Glu He Ala Glu Gly Trp Leu Ser Lys Lys Asn Thr Gly Val Asn Asp 

195 200 205 

etc gat aag aat aaa tat aaa att gaa tta act gtt gag ggt aaa acc 

Leu Asp Lys Asn Lys Tyr Lys He Glu Leu Thr Val Glu Gly Lys Thr 

210 215 220 

act gtt gaa acg aaa gaa ctt aat caa cca eta gat gtc gtt gtg eta 

Thr Val Glu Thr Lys Glu Leu Asn Gin Pro Leu Asp Val Val Val Leu 

225 230 235 240 

tta gat aat tea aat agt atg aat aat- gaa aga gec aat aat tct caa 

Leu Asp Asn Ser Asn Ser Met Asn Asn Glu Arg Ala Asn Asn Ser Gin 

245 250 255 

aga gca tta aaa get ggg gaa gca gtt gaa aag ctg att gat aaa att 

Arg Ala Leu Lys Ala Gly Glu Ala Val Glu Lys Leu He Asp Lys He 

2 60 265 270 

aca tea aat aaa gac aat aga gta get ctt gtg aca tat gec tea acc 

Thr Ser Asn Lys Asp Asn Arg Val Ala Leu Val Thr Tyr Ala Ser Thr 



48 



aga caa aaa ata tgg aga ggg tta tea gtt act tta eta ate ctg tec 96 
Arg Gin Lys He Trp Arg Gly Leu Ser Val Thr Leu Leu He Leu Ser 

20 25 30 



144 



192 



240 



293 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 
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275 280 28-5- 

att ttt gat ggt act gaa gcg acc gta tea aag gga gtt gec gat caa 

lie Phe Asp Gly Thr Glu Ala Thr Val Ser Lys Gly Val Ala Asp Gin 
290 295 300 

aat ggt aaa gcg ctg aat gat agt gta tea tgg gat tat cat aaa act 

Asn Gly Lys Ala Leu Asn Asp Ser Val Ser Trp Asp Tyr His Lys Thr 
305 * 310 315 320 

act ttt aca gca act aca cat aat tac agt tat tta aat tta aca aat 

Thr Phe Thr Ala Thr Thr His Asn Tyr Ser Tyr Leu Asn Leu Thr Asn 

325 330 335 

gat get aac gaa gtt aat att eta aag tea aga att cca aag gaa gcg 

Asp Ala Asn Glu Val Asn He Leu Lys Ser Arg He Pro Lys Glu Ala 

340 345 350 

gag cat ata aat ggg gat cgc acg etc tat caa ttt ggt gcg aca ttt 

Glu His He Asn Gly Asp Arg Thr Leu Tyr Gin Phe Gly Ala Thr Phe 

355 360 365 



aca aag aaa gtt tct gca acg aaa caa ate aaa act cat ggt gag cca 

Thr Lys Lys val Ser Ala Thr Lys Gin He Lys Thr His Gly Glu Pro 
515 520 S25 

aca aca tta tac ttt aat gga aat ata aga cct aaa ggt tat gac att 

Thr Thr Leu Tyr Phe Asn Gly Asn lie Arg Pro Lys Gly Tyr Asp He 

530 535 540 

ttt act gtt ggg att ggt gta aac gga gat cct ggt gca act cct ctt 

Phe Thr Val Gly He Gly Val Asn Gly Asp Pro Gly Ala Thr 2ro Leu 

545 550 555 560 

gaa get gag aaa ttt atg caa tea ata tea agt aaa aca gaa aat tat 



912 



960 



1008 



1056 



1104 



1200 



1248 



1296 



act caa aaa get eta atg aaa gca aat gaa att, ,tta gag aca caa agt 1152 

Thr Gin Lys Ala Leu Met Lys Ala Asn Glu He Leu Glu Thr Gin Ser 

370 375 380 

tct aat get aga aaa aaa ctt att ttt cac gta act gat ggt gtc cct 

Ser Asn Ala Arg Lys Lys Leu lie Phe His val Thr Asp Gly Val Pro 

385 390 395 400 

acg atg tct tat gec ata aat ttt aat cct tat ata tea aca tct. tac 

Thr Met Ser Tyr Ala He Asn Phe Asn Pro Tyr He Ser Thr Ser Tyr 

405 410 415 

caa aac cag ttt aat tct ttt tta aat aaa ata cca gat aga agt ggt 

Gin Asn Gin Phe Asn Ser Phe Leu Asn Lys He Pro Asp Arg Ser Gly 

420 425 430 

att etc caa gag gat ttt ata ate aat ggt gat gat tat caa ata gta 

He Leu Gin Glu Asp Phe He He Asn Gly Asp Asp Tyr Gin He Val 

435 440 445 

aaa gga gat gga gag agt ttt aaa ctg ttt teg gat aga aaa gtt cct 

Lys Gly Asp Gly Glu Ser Phe Lys Leu Phe Ser Asp Arg Lys Val Pro 

450 455 460 

gtt act gga gga acg aca caa gca get tat cga gta ccg caa aat caa 

Val Thr Gly Gly Thr Thr Gin Ala Ala Tyr Arg Val Pro Gin Asn Gin 

465 470 475 480 

etc tct gta atg agt aat gag gga tat gca att aat agt gga tat att 

Leu Ser Val Met Ser Asn Glu Gly Tyr Ala He Asn Ser Gly Tyr He 

485 490 495 



1344 



1392 



1440 



1488 



tat etc tat tgg aga gat tac aac tgg gtc tat cca ttt gat cct aag 1536 
Tyr Leu Tyr Trp Arg Asp Tyr Asn Trp Val Tyr Pro Phe Asp Pro Lys 

500 505 510 



1584 



1632 



1680 



1728 
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Glu Ala Glu Lys Phe Met Gin Ser lie Ser Ser Lys Thr' Glu Asn Tyr 

565 570 575 

act aat gtt gat gat aca aat aaa att tat gat gag eta aat aaa tac 
Thr Asn Val Asp Asp Thr Asn Lys He Tyr Asp Glu Leu Asn Lys Tyr 

580 585 590 



agt ttt aca cat gat gat tac gtt ttg gtt gga aat gat ggc agt caa 

Ser Phe Thr His Asp Asp Tyr Val Leu Val Gly Asn Asp Gly Ser Gin 

625 630 635 640 

tta aaa aat ggt gtg get ctt ggt gga cca aac agt gat ggg gga att 

Leu Lys Asn Gly Val Ala Leu Gly Gly Pro Asn Ser Asp Gly Gly He 

645 650 655 

,« « • 

tta aaa gat gtt aca gtg act tat gat aag aca tct caa acc ate aaa 

Leu Lys Asp Val Thr Val Thr Tyr Asp Lys Thr Ser Gin Thr He Lys 

660 665 610 

ate aat cat ttg aac tta gga agt gga caa aaa gta gtt ctt acc tat 

He Asn His Leu Asn Leu Gly Ser Gly Gin Lys Val Val Leu The Tyr 

675 660 685 

gat gta cgt tta aaa gat aac tat ata agt aac aaa ttt tac aat aca 

Asp Val Arg Leu Lys Asp Asn Tyr He Ser Asn Lys Phe Tyr Asn Thr 
690 695 700 

aat aat cgt aca acg eta agt ccg aag agt gaa aaa gaa cca aat act 

Asn Asn Arg Thr Thr Leu Ser Pro Lys Ser Glu Lys Glu Pro Asn Thr 

705 710 71S 720 

att cgt gat ttc cca att ccc aaa att cgt gat gtt cgt gag ttt ccg 

He Arg Asp Phe Pro He Pro Lys lie Arg Asp Val Arg Glu Phe Pro 

725 730 735 



aaa gtt aat aaa gac aaa cat tea gaa teg ctt ttg gga get aag ttt 

Lys Val Asn Lys Asp Lys His Ser Glu Ser Leu Leu Gly Ala Lys Phe 

755 760 765 

caa ctt cag ata gaa aaa gat ttt tct ggg tat aag caa ttt gtt cca 

Gin Leu Gin He Glu Lys Asp Phe Ser Gly Tyr Lys Gin Phe Val Pro 
770 775 780 

gag gga agt gat gtt aca aca aag aat gat ggt aaa att tat ttt aaa 

Glu Gly Ser Asp Val Thr Thr Lys Asn Asp Gly Lys He Tyr Phe Lys 

7 85 790 795 800 

gca ctt caa gat ggt aac tat aaa tta tat gaa att tea agt cca gat 

Ala Leu Gin Asp Gly Asn Tyr Lys Leu Tyr Glu He Ser Ser Pro Asp 

805 810 815 

ggc tat ata gag gtt aaa acg aaa cct gtt gtg aca ttt aca att caa 

Gly Tyr He Glu Val Lys Thr Lys Pro Val Val Thr Phe Thr He Gin 

820 825 830 



1776 



ttt aaa aca att gtt gag gaa aaa cat tct att gtt gat gga aat gtg 1824 
Phe Lys Thr He Val Glu Glu Lys His Ser He Val Asp Gly Asn Val 
595 600 605 

act gat cct atg gga gag atg att gaa ttc caa tta aaa aat ggt caa 1872 
Thr Asp Pro Met Gly Glu Met He- Glu Phe Gin Leu Lys Asn Gly Gin 
610 615 620 



1920 



1968 



2016 



2064 



2112 



2160 



2208 



gta eta acc ate agt aat cag aag aaa atg ggt gag gtt gaa ttt att 2256 
Val Leu Thr He Ser Asn Gin Lys Lys Met Gly Glu Val Glu Phe He 

740 745 750 



2304 



2352 



2400 



2448 



2496 



aat gga gaa gtt acg aac ctg aaa gca gat cca aat get aat aaa aat 2544 
Asn Gly Glu Val Thr Asn Leu Lys Ala Asp Pro Asn Ala Asn Lys Asn 
835 840 845 
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caa ate ggg tat ctt gaa gga aat ggt 
Gin lie Gly Tyr Leu Glu Gly Asn Gly 
850 855 

ccc aaa cgc cca cca ggt gtt ttt cct 
Pro Lys Arg Pro Pro Gly Val Phe Pro 
865 870 

att gtc tat ata tta gtt ggt tct act 
lie Val Tyr lie Leu Val Gly Ser Thr 

885 

tct ttc cgt cgt aaa caa ttg taa 
Ser Phe Arg Arg Lys Gin Leu 

900 



aaa cat ctt att acc aac act 2592 
Lys His Leu rle Thr Asn Thr 
860 

aaa aca ggg gga att ggt aca 2640 
Lys Thr Gly Gly lie Gly Thr 
875 880 

ttt atg ata ctt acc att tgt 2688 
Phe Met He Leu Thr He Cys 
B90 895 

2712 
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SEQUENCE LISTING 

<110> Adderson, Elisabeth 
Bohnsack, John 

<120> GROUP B STREPTOCOCCUS POYPEPTIDES NUCLEIC ACIDS AND 
THERAPEUTIC COMPOSITIONS AND VACCINES THEREOF 

<130> 2511-1-001 

<14 0> UNKNOWN 
<141> 2000-08-08 

<160> 37 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 737 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 1 

atgacccttg ttaaaaatca agatgctctt gataaagcta ctgcaaatac agatgatgcg 60 

gcatttttgg aaattccagt tgcatcaact attaatgaaa aagcagtttt aggaaaagca 120 

attgaaaata cttttgaact tcaatatgac catactcctg ataaagctga caatccaaaa 180 

ccatctaatc ctccaagaaa accagaagtt catactggtg ggaaacgatt tgtaaagaaa 240 

gactcaacag aaacacaaac actaggtggt gctgagtttg atttgttggc ttctgatggg 300 

acagcagtaa aatggacaga tgctcttatt aaagcgaata ctaataaaaa ctatattgct 360 

ggagaagctg ttactgggca accaatcaaa ttgaaatcac atacagacgg tacgtttgag 420 

attaaaggtt tggcttatgc agttgatgcg aatgcagagg gtacagcagt aacttacaaa 480 

ttaaaagaaa caaaagcacc agaaggttat gtaatccctg ataaagaaat cgagtttaca 540 

gtatcacaaa catcttataa tacaaaacca actgacatca cggttgatag tgctgatgca 600 

acacctgata caattaaaaa caacaaacgt ccttcaatcc ctaatactgg tggtattggt 660 

acggctatct ttgtcgctat cggtgctgcg gtgatggctt ttgctgttaa ggggatgaag 720 
cgtcgtacaa aagataa 737 
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<210> 2 
<211> 245 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 2 

Met Thr Leu Val Lys Asn Gin Asp Ala Leu Asp Lys Ala Thr Ala Asn 
15 10 15 

Thr Asp Asp Ala Ala Phe Leu Glu lie Pro Val Ala Ser Thr lie Asn 

20 25 30 

Glu Lys Ala Val Leu Gly Lys Ala lie Glu Asn Thr Phe Glu Leu Gin 

35 40 45 

Tyr Asp His Thr Pro Asp Lys Ala Asp Asn Pro Lys Pro Ser Asn Pro 
50 55 ' 60 

Pro Arg Lys Pro Glu Val His Thr Gly Gly Lys Arg Phe Val Lys Lys 
65 70 75 80 

Asp Ser Thr Glu Thr Gin Thr Leu Gly Gly Ala Glu Phe Asp Leu Leu 

85 90 95 

Ala Ser Asp Gly Thr Ala Val Lys Trp Thr Asp Ala Leu lie Lys Ala 

100 105 110 

Asn Thr Asn Lys Asn Tyr lie Ala Gly Glu Ala Val Thr Gly Gin Pro 
115 120 125 

He Lys Leu Lys Ser His Thr Asp Gly Thr Phe Glu He Lys Gly Leu 
130 135 140 

Ala Tyr Ala Val Asp Ala Asn Ala Glu Gly Thr Ala Val Thr Tyr Lys 
145 150 ' 155 160 

Leu Lys Glu Thr Lys Ala Pro Glu Gly Tyr Val lie Pro Asp Lys Glu 

165 170 175 

lie Glu Phe Thr Val Ser Gin Thr Ser Tyr Asn Thr Lys Pro Thr Asp 

180 185 190 

He Thr Val Asp Ser Ala Asp Ala Thr Pro Asp Thr He Lys Asn Asn 
195 200 205 



Lys Arg Pro Ser He Pro Asn Thr Gly Gly He Gly Thr Ala He Phe 
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210 215 220 

Val Ala lie Gly Ala Ala Val Met Ala Phe Ala Val Lys Gly Met Lys 

225 230 235 240 

Arg Arg Thr Lys Asp 

245 



<210> 3 
<211> 924 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 3 

atgaaacaaa cattaaaact tatgttttct tttctgttga tgttagggac tatgtttgga 60 
attagccaaa ctgttttagc gcaagaaact catcagttga cgattgttca tcttgaagca 120 
agggatattg atcgtccaaa tccacagttg gagattgccc ctaaagaagg gactccaatt 180 
gaaggagtac tctatcagtt gtaccaatta aaatcaactg aagatggcga tttgttggca 240 
cattggaatt ccctaactat cacagaattg aaaaaacagg cgcagcaggt ttttgaagcc 300 
actactaatc aacaaggaaa ggctacattt aaccaactac cagatggaat ttattatggt 360 
ctggcggtta aagccggtga aaaaaatcgt aatgtctcag ctttcttggt tgacttgtct 420 
gaggataaag tgatttatcc taaaatcatc tggtccacag gtgagttgga cttgcttaaa 480 
gttggtgtgg atggtgatac caaaaaacca ctagcaggcg ttgtctttga actttatgaa 540 
aagaatggta ggactcctat tcgtgtgaaa aatggggtgc attctcaaga tattgacgct 600 

» 

gcaaaacatt tagaaacaga ttcatcaggg catatcagaa tttccgggct catccatggg 660 
gactatgtct taaaagaaat cgagacacag tcaggatatc agatcggaca ggcagagact 720 
gctgtgacta ttgaaaaatc aaaaacagta acagtaacga ttgaaaataa aaaagttccg 780 
acacctaaag tgccatctcg aggaggtctt attcccaaaa caggtgagca acaggcaatg 840 
gcacttgtaa ttattggtgg tattttaatt gctttagcct tacgattact atcaaaacat 900 
cggaaacatc aaaataagga ttag 924 



3 
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<210> 4 
<211> 307 
<212> PRT 

<213> Streptococcus agalactiae 

<400> 4 

Met Lys Gin Thr Leu Lys Leu Met Phe Ser Phe Leu Leu Met Leu Gly 
15 10 15 

Thr Met Phe Gly lie Ser Gin Thr Val Leu Ala Gin Glu Thr His Gin 

20 25 30 

Leu Thr lie Val His Leu Glu Ala Arg Asp lie Asp Arg Pro Asn Pro 

35 40 45 

Gin Leu Glu lie Ala Pro Lys Glu Gly Thr Pro lie Glu Gly Val Leu 
50 55 60 

Tyr Gin Leu Tyr Gin Leu Lys Ser Thr Glu Asp Gly Asp Leu Leu Ala 
65 ' 70 75 80 J 

His Trp Asn Ser Leu Thr lie Thr Glu Leu Lys Lys Gin Ala Gin Gin 

85 90 95 

Val Phe Glu Ala Thr Thr Asn Gin Gin Gly Lys Ala Thr Phe Asn Gin 

100 105 110 

Leu Pro Asp Gly lie Tyr Tyr Gly Leu Ala Val Lys Ala Gly Glu Lys 
115 120 125 

Asn Arg Asn Val Ser Ala Phe Leu Val Asp Leu Ser Glu Asp Lys Val 
130 135 140 

lie Tyr Pro Lys lie lie Trp Ser Thr Gly Glu Leu Asp Leu Leu Lys 
145 150 155 160 

Val Gly Val Asp Gly Asp Thr Lys Lys Pro Leu Ala Gly Val Val Phe 

165 170 175 

Glu Leu Tyr Glu Lys Asn Gly Arg Thr Pro lie Arg Val Lys Asn Gly 

180 185 190 

Val His Ser Gin Asp lie Asp Ala Ala Lys His Leu Glu Thr Asp Ser 
195 200 205 

Ser Gly His lie Arg lie Ser Gly Leu lie His Gly Asp Tyr Val Leu 
210 215 220 
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Lys Glu He Glu Thr Gin Ser Gly Tyr Gin He Gly Gin Ala Glu Thr 
225 230 235 240 

Ala Val Thr He Glu Lys Ser Lys Thr Val Thr Val Thr He Glu Asn 

245 ' 250 255 

Lys Lys Val Pro Thr Pro Lys Val Pro Ser Arg Gly Gly Leu lie Pro 

260 265 270 

Lys Thr Gly Glu Gin Gin Ala Met Ala Leu Val He He Gly Gly He 
275 280 285 

Leu He Ala Leu Ala Leu Arg Leu Leu Ser Lys His Arg Lys His Gin 
290 295 300 

Asn Lys Asp 
305 



<210> 5 
<211> 918 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 5 

atgggacaaa aatcaaaaat atctctagct acgaatattc gtatatggat ttttcgttta 60 
attttcttag cgggtttcct tgttttggca tttcccatcg ttagtcaggt catgtacttt 120 
caagcctctc acgccaatat taatgctttt aaagaagctg ttaccaagat tgaccgggtg 180 
gagattaatc ggcgtttaga acttgcttat gcttataacg ccagtatagc aggtgccaaa 240 
actaatggcg aatatccagc gcttaaagac ccctactctg ctgaacaaaa gcaggcaggg 300 
gtcgttgagt acgcccgcat gcttgaagtc aaagaacaaa taggtcatgt gattattcca 360 
agaattaatc aggatatccc tatttacgct ggctctgctg aagaaaatct tcagaggggc 420 
gttggacatt tagaggggac cagtcttcca gtcggtggtg agtcaactca tgccgttcta 480 
actgcccatc gagggctacc aacggccaag ctatttacca atttagacaa ggtaacagta 54 0 
ggtgaccgtt tttacattga acacatcggc ggaaagattg cttatcaggt agaccaaatc 600 
aaagttatcg^ cccctgatca gttagaggat ttgtacgtga ttcaaggaga agatcacgtc 660 
accctattaa cttgcacacc ttatatgata aatagtcatc gcctcctcgt tcgaggcaag 720 
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cgaattcctt atgtggaaaa aacagtgcag aaagattcaa agaccttcag gcaacaacaa 780 
tacctaacct atgctatgtg ggtagtcgtt ggacttatct tgctgtcgct tctcatttgg 8 40 
tttaaaaaga cgaaacagaa aaagcggaga aagaatgaaa aagcggctag tcaaaatagt 900 

918 

cacaataatt cgaaataa 



<210> 6 
<211> 305 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 6 

Met Gly Gin Lys Ser Lys He Ser Leu Ala Thr Asn He Arg He Trp 
1 5 10 15 

He Phe Arg Leu He Phe Leu Ala Gly Phe Leu Val Leu Ala Phe Pro-- 

20 25 30 

He Val Ser Gin Val Met Tyr Phe Gin Ala Ser His Ala Asn He Asn 

35 40 45 

Ala Phe Lys Glu Ala Val Thr Lys He Asp Arg Val Glu He Asn Arg 
50 55 60 

Arq Leu Glu Leu Ala Tyr Ala Tyr Asn Ala Ser He Ala Gly Ala Lys 

m 75 80 

65 70 10 

Thr Asn Gly Glu Tyr Pro Ala Leu Lys Asp Pro Tyr Ser Ala Glu Gin 

85 90 95 

Lys Gin Ala Gly Val Val Glu Tyr Ala Arg Met Leu Glu Val Lys Glu 

100 105 11° 

Gin He Gly His Val He He Pro Arg He Asn Gin Asp He Pro He 
115 120 125 

Tyr Ala Gly Ser Ala Glu Glu Asn Leu Gin Arg Gly Val Gly His Leu 
130 135 140 

Glu Gly Thr Ser Leu Pro Val Gly Gly Glu Ser Thr His Ala Val Leu 

ten 155 160 

145 I 50 133 

Thr Ala His Arg Gly Leu Pro Thr Ala Lys Leu Phe Thr Asn Leu Asp 

165 170 175 
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Lys Val Thr Val Gly Asp Arg Phe Tyr He Glu His He Gly Gly Lys 

180 185 190 

He Ala Tyr Gin Val Asp Gin He Lys Val He Ala Pro Asp Gin Leu 
195 200 205 

Glu Asp Leu Tyr Val He Gin Gly Glu Asp His Val Thr Leu Leu Thr 
210 215 220 

Cys Thr Pro Tyr Met He Asn Ser His Arg Leu Leu Val Arg Gly Lys 
225 230 235 240 

Arg He Pro Tyr Val Glu Lys Thr Val Gin Lys Asp Ser Lys Thr Phe 

245 250 255 

Arg Gin Gin Gin Tyr Leu Thr Tyr Ala Met Trp Val Val Val Gly Leu 

260 265 270 

He Leu Leu Ser Leu Leu He Trp Phe Lys Lys Thr Lys Gin Lys Lys^ 
275 280 285 

Arg Arg Lys Asn Glu Lys Ala Ala Ser Gin Asn Ser His Asn Asn Ser 
290 295 300 

Lys 
305 



<210> 7 
<211> 852 
<212> DNA 

<213> Streptococcus agalactiae 



<400> 7 

atgaaaaagc ggctagtcaa aatagtcaca ataattcgaa ataataaaat cagaaccctc 60 
atttttgtga tgggaagtct gattctctta tttccgattg tgagccaggt aagttactac 120 
cttgettcgc atcaaaatat taatcaattt aagcgggaag tcgctaagat tgatactaat 180 
acggttgaac gacgcatcgc tttagctaat gcttacaatg agacgttatc aaggaatccc 240 
ttgcttatag acccttttac cagtaagcaa aaagaaggtt tgagagagta tgctcgtatg 300 
cttgaagttc atgagcaaat aggtcatgtg gcaatcccaa gtattggggt tgatattcca 360 
atttatgctg gaacatccga aactgtgctt cagaaaggta gtgggcattt ggagggaacc 420 
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agtcttccag tgggaggttt gtcaacccat tcagtactaa ctgcccaccg tggcttgcca 480 
acagctaggc tatttaccga cttaaataaa gttaaaaaag gccagatttt ctatgtgacg 540 
aacatcaagg aaacacttgc ctacaaagtc gtgtctatca aagttgtgga tccaacagct 600 
ttaagtgagg ttaagattgt caatggtaag gattatataa ccttgctgac ttgcacacct 660 
tacatgatca atagtcatcg tctcttggta aaaggagagc gtattcctta tgattctacc 720 
gaggcggaaa agcacaaaga acaaaccgta caagattatc gtttgtcact agtgttgaag 780 
atactactag tattattaat tggactcttc atcgtgataa tgatgagaag atggatgcaa 840 
catcgtcaat aa 852 



<210> 8 
<211> 283 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 8 

Met Lys Lys Arg Leu Val Lys lie Val Thr He He Arg Asn Asn Lys 
15 10 15 

He Arg Thr Leu He Phe Val Met Gly Ser Leu He Leu Leu Phe Pro 

20 25 30 

He Val Ser Gin Val Ser Tyr Tyr Leu Ala Ser His Gin Asn He Asn 

35 40 45 

Gin Phe Lys Arg Glu Val Ala Lys He Asp Thr Asn Thr Val Glu Arg 
50 55 60 

Arg He Ala Leu Ala Asn Ala Tyr Asn Glu Thr Leu Ser Arg Asn Pro 
65 70 75 80 

Leu Leu He Asp Pro Phe Thr Ser Lys Gin Lys Glu Gly Leu Arg Glu 

85 90 95 

Tyr Ala Arg Met Leu Glu Val His Glu Gin He Gly His Val Ala He 

100 105 HO 

Pro Ser He Gly Val Asp He Pro He Tyr Ala Gly Thr Ser Glu Thr 
115 120 125 



8 
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Val Leu Gin Lys Gly Ser Gly His Leu Glu Gly Thr Ser Leu Pro Val 
130 135 140 

Gly Gly Leu Ser Thr His Ser Val Leu Thr Ala His Arg Gly Leu Pro 
14 5 150 155 160 

Thr Ala Arg Leu Phe Thr Asp Leu Asn Lys Val Lys Lys Gly Gin lie 

165 170 175 

Phe Tyr Val Thr Asn lie Lys Glu Thr Leu Ala Tyr Lys Val Val Ser 

180 185 190 

He Lys Val Val Asp Pro Thr Ala Leu Ser Glu Val Lys He Val Asn 
195 200 205 

Gly Lys Asp Tyr He Thr Leu Leu Thr Cys Thr Pro Tyr Met He Asn 
210 215 220 

Ser His Arg Leu Leu Val Lys Gly Glu Arg lie Pro Tyr Asp Ser Thr 
225 230 235 240 

Glu Ala Glu Lys His Lys Glu Gin Thr Val Gin Asp Tyr Arg Leu Ser 

245 250 255 

Leu Val Leu Lys He Leu Leu Val Leu Leu He Gly Leu Phe He Val 

260 265 270 

He Met Met Arg Arg Trp Met Gin His Arg Gin 
275 280 



<210> 9 
<211> 2712 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 9 

atgatgattg tgaataatgg ttatctagaa gggagaaaaa tgaaaaagag acaaaaaata 60 
tggagagggt tatcagttac tttactaatc ctgtcccaaa ttccatttgg tatattggta 120 
caaggtgaaa cccaagatac caatcaagca cttggaaaag taattgttaa aaaaacggga 180 
gacaatgcta caccattagg caaagcgact tttgtgttaa aaaatgacaa tgataagtca 240 
gaaacaagtc acgaaacggt agagggttct ggagaagcaa cctttgaaaa cataaaacct 300 
ggagactaca cattaagaga agaaacagca ccaattggtt ataaaaaaac tgataaaacc 360 

9 
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tggaaagtta aagttgcaga taacggagca 
gcagagaaac gaaaagaagt tttgaatgcc 
acaaaagaaa attacccatt agttaatgta 
gcattgaatc caataaatgg aaaagatggt 
aaaaaaaata caggggtcaa tgatctcgat 
gagggtaaaa ccactgttga aacgaaagaa 
ttagataatt caaatagtat gaataatgaa 
gctggggaag cagttgaaaa gctgattgat 
gctcttgtga catatgcctc aaccattttt 
gttgccgatc aaaatggtaa agcgctgaat 
acttttacag caactacaca taattacagt 
gttaatattc taaagtcaag aattccaaag 
ctctatcaat ttggtgcgac atttactcaa 
gagacacaaa gttctaatgc tagaaaaaaa 
acgatgtctt atgccataaa ttttaatcct 
aattcttttt taaataaaat accagataga 
aatggtgatg attatcaaat agtaaaagga 
agaaaagttc ctgttactgg aggaacgaca 
ctctctgtaa tgagtaatga gggatatgca 
agagattaca actgggtcta tccatttgat 
caaatcaaaa ctcatggtga gccaacaaca 
ggttatgaca tttttactgt tgggattggt 
gaagctgaga aatttatgca atcaatatca 
gatacaaata aaatttatga tgagctaaat 

10 
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acaataatcg agggtatgga tgcagataaa 420 
caatatccaa aatcagctat ttatgaggat 480 
gagggttcca aagttggtga acaatacaaa 54 0* 
cgaagagaga ttgctgaagg ttggttatca 600 
aagaataaat ataaaattga attaactgtt 660 
cttaatcaac cactagatgt cgttgtgcta 720 
agagccaata attctcaaag agcattaaaa 780 
aaaattacat caaataaaga caatagagta 840 
gatggtactg aagcgaccgt atcaaaggga 900 
gatagtgtat catgggatta tcataaaact 960 
tatttaaatt taacaaatga tgctaacgaa 1020 
gaagcggagc atataaatgg ggatcgcacg 108 0 
aaagctctaa tgaaagcaaa tgaaatttta 1140 
cttatttttc acgtaactga tggtgtccct 1200 
tatatatcaa catcttacca aaaccagttt 1260 
agtggtattc tccaagagga ttttataatc 1320 
gatggagaga gttttaaact gttttcggat 1380 
caagcagctt atcgagtacc gcaaaatcaa 1440 
attaatagtg gatatattta tctctattgg 1500 
cctaagacaa agaaagtttc tgcaacgaaa 1560 
ttatacttta atggaaatat aagacctaaa 1620 
gtaaacggag atcctggtgc aactcctctt 1680 
agtaaaacag aaaattatac taatgttgat 1740 
aaatacttta aaacaattgt tgaggaaaaa 1800 
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cattctattg ttgatggaaa tgtgactgat 
aaaaatggtc aaagttttac acatgatgat 
ttaaaaaatg gtgtggctct tggtggacca 
acagtgactt atgataagac atctcaaacc 
ggacaaaaag tagttcttac ctatgatgta 
ttttacaata caaataatcg tacaacgcta 
attcgtgatt tcccaattcc caaaattcgt 
agtaatcaga agaaaatggg tgaggttgaa 
gaatcgcttt tgggagctaa gtttcaactt 
caatttgttc cagagggaag tgatgttaca 
gcacttcaag atggtaacta taaattatat 
gttaaaacga aacctgttgt gacatttaca 
gcagatccaa atgctaataa aaatcaaatc 
attaccaaca ctcccaaacg cccaccaggt 
attgtctata tattagttgg ttctactttt 
aaacaattgt aa 

<210> 10 
<211> 903 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 10 

Met Met lie Val Asn Asn Gly Tyr 
1 5 

Arg Gin Lys lie Trp Arg Gly Leu 

20 

Gin lie Pro Phe Gly lie Leu Val 

35 40 
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cctatgggag agatgattga attccaatta 18 60 
tacgttttgg ttggaaatga tggcagtcaa 1920 
aacagtgatg ggggaatttt aaaagatgtt 1980 
atcaaaatca atcatttgaa cttaggaagt 2040 
cgtttaaaag ataactatat aagtaacaaa 2100 
agtccgaaga gtgaaaaaga accaaatact 2160 
gatgttcgtg agtttccggt actaaccatc 2220 
tttattaaag ttaataaaga caaacattca 2280 
cagatagaaa aagatttttc tgggtataag 2340 
acaaagaatg atggtaaaat ttattttaaa 24 00 
gaaatttcaa gtccagatgg ctatatagag 24 60 
attcaaaatg gagaagttac gaacctgaaa 2520 
gggtatcttg aaggaaatgg taaacatctt 2580 
gtttttccta aaacaggggg aattggtaca 2640 
atgatactta ccatttgttc tttccgtcgt 2700 

2712 



Leu Glu Gly Arg Lys Met Lys Lys 
10 15 

Ser Val Thr Leu Leu lie Leu Ser 
25 30 

Gin Gly Glu Thr Gin Asp Thr Asn 

45 
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Gin Ala Leu Gly 
50 

Pro Leu Gly Lys 
65 

Glu Thr Ser His 



Asn lie Lys Pro 

100 

Gly Tyr Lys Lys 
115 

Gly Ala Thr lie 
130 

Lys Glu Val Leu 
145 

Thr Lys Glu Asn 



Glu Gin Tyr Lys 

180 

Glu lie Ala Glu 
195 

Leu Asp Lys Asn 
210 

Thr Val Glu Thr 
225 

Leu Asp Asn Ser 



Arg Ala Leu Lys 

260 

Thr Ser Asn Lys 
275 

lie Phe Asp Gly 
290 



Lys Val lie Val 

55 

Ala Thr Phe Val 
70 

Glu Thr Val Glu 
85 

Gly Asp Tyr Thr 



Thr Asp Lys Thr 

120 

lie Glu Gly Met 
135 

Asn Ala Gin Tyr 
150 

Tyr Pro Leu Val 
165 

Ala Leu Asn Pro 



Gly Trp Leu Ser 

200 

Lys Tyr Lys lie 
215 

Lys Glu Leu Asn 
230 

Asn Ser Met Asn 
245 

Ala Gly Glu Ala 



Asp Asn Arg Val 

280 

Thr Glu Ala Thr 
295 



Lys Lys Thr Gly 

60 

Leu Lys Asn Asp 

75 

Gly Ser Gly Glu 
90 

Leu Arg Glu Glu 
105 

Trp Lys Val Lys 



Asp Ala Asp Lys 

140 

Pro Lys Ser Ala 
155 

Asn Val Glu Gly 
170 

lie Asn Gly Lys 
185 

Lys Lys Asn Thr 



Glu Leu Thr Val 

220 

Gin Pro Leu Asp 
235 

Asn Glu Arg Ala 
250 

Val Glu Lys Leu 
265 

Ala Leu Val Thr 



Val Ser Lys Gly 

300 



Asp Asn Ala Thr 



Asn Asp Lys Ser 

80 

Ala Thr Phe Glu 

95 

Thr Ala Pro lie 
110 

Val Ala Asp Asn 
125 

Ala Glu Lys Arg 



lie Tyr Glu Asp 

160 

Ser Lys Val Gly 
175 

Asp Gly Arg Arg 
190 

Gly Val Asn Asp 
205 

Glu Gly Lys Thr 



Val Val Val Leu 

240 

Asn Asn Ser Gin 
255 

lie Asp Lys lie 
270 

Tyr Ala Ser Thr 
285 

Val Ala Asp Gin 
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Asn Gly Lys Ala 
305 

Thr Phe Thr Ala 



Asp Ala Asn Glu 

340 

Glu His lie Asn 
355 

Thr Gin Lys Ala 
370 



Leu Asn Asp Ser 
310 

Thr Thr His Asn 
325 

Val Asn lie Leu 



Gly Asp Arg Thr 

360 

Leu Met Lys Ala 
375 



Val Ser Trp Asp 
315 

Tyr Ser Tyr Leu 
330 

Lys Ser Arg lie 
345 

Leu Tyr Gin Phe 



Asn Glu lie Leu 

380 



Tyr His Lys Thr 

320 

Asn Leu Thr Asn 
335 

Pro Lys Glu Ala 
350 

Gly Ala Thr Phe 
365 

Glu Thr Gin Ser 



Ser Asn Ala Arg 
385 

Thr Met Ser Tyr 



Gin Asn Gin Phe 

420 

lie Leu Gin Glu 
435 

Lys Gly Asp Gly 
450 

Val Thr Gly Gly 
465 

Leu Ser Val Met 



Tyr Leu Tyr Trp 

500 

Thr Lys Lys Val 
515 

Thr Thr Leu Tyr 
530 

Phe Thr Val Gly 
545 



Lys Lys Leu lie 
390 

Ala lie Asn Phe 
405 

Asn Ser Phe Leu 



Asp Phe lie lie 

440 

Glu Ser Phe Lys 
455 

Thr Thr Gin Ala 
470 

Ser Asn Glu Gly 
485 

Arg Asp Tyr Asn 



Ser Ala Thr Lys 

520 

Phe Asn Gly Asn 
535 

lie Gly Val Asn 
550 



Phe His Val Thr 
395 

Asn Pro Tyr lie 
410 

Asn Lys lie Pro 
425 

Asn Gly Asp Asp 



Leu Phe Ser Asp 

460 

Ala Tyr Arg Val 
475 

Tyr Ala lie Asn 
490 

Trp Val Tyr Pro 
505 

Gin lie Lys Thr 



lie Arg Pro Lys 

540 

Gly Asp Pro Gly 
555 



Asp Gly Val Pro 

400 

Ser Thr Ser Tyr-' 
_ 415 

Asp Arg Ser Gly 
430 

Tyr Gin lie Val 
445 

Arg Lys Val Pro 



Pro Gin Asn Gin 

480 

Ser Gly Tyr lie 
495 

Phe Asp Pro Lys 
510 

His Gly Glu Pro 
525 

Gly Tyr Asp lie 



Ala Thr Pro Leu 

560 
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Glu Ala Glu Lys 



Thr Asn Val Asp 

580 

Phe Lys Thr He 
595 

Thr Asp Pro Met 
610 



Phe Met Gin Ser 
565 

Asp Thr Asn Lys 



Val Glu Glu Lys 

600 

Gly Glu Met He 
615 



He Ser Ser Lys 
570 

He Tyr Asp Glu 
585 

His Ser lie Val 



Glu Phe Gin Leu 

620 



Thr Glu Asn Tyr 
575 

Leu Asn Lys Tyr 
590 

Asp Gly Asn Val 
605 

Lys Asn Gly Gin 



Ser Phe Thr His 
625 

Leu Lys Asn Gly 



Leu Lys Asp Val 

660 

He Asn His Leu 
675 



Asp Asp Tyr Val 
630 

Val Ala Leu Gly 
645 

Thr Val Thr Tyr 



Asn Leu Gly Ser 

680 



Leu Val Gly Asn 
635 

Gly Pro Asn Ser 
650 

Asp Lys Thr Ser 
665 

Gly Gin Lys Val 



Asp Gly Ser Gin 

640 

Asp Gly Gly He 
655 

Gin Thr lie Lys 
670 

Val Leu Thr Tyr 
685 



Asp Val Arg Leu 
690 

Asn Asn Arg Thr 
705 

He Arg Asp Phe 



Val Leu Thr He 

740 

Lys Val Asn Lys 



Lys Asp Asn Tyr 
695 

Thr Leu Ser Pro 
710 

Pro lie Pro Lys 
725 

Ser Asn Gin Lys 



Asp Lys His Ser 

760 



lie Ser Asn Lys 

700 

Lys Ser Glu Lys 
715 

He Arg Asp Val 
730 

Lys Met Gly Glu 
745 

Glu Ser Leu Leu 



Phe Tyr Asn Thr 



Glu Pro Asn Thr 

720 

Arg Glu Phe Pro 
735 

Val Glu Phe lie 
750 

Gly Ala Lys Phe 
765 



Gin Leu Gin He 
770 

Glu Gly Ser Asp 
785 

Ala Leu Gin Asp 



Glu Lys Asp Phe 
775 

Val Thr Thr Lys 
7 90 

Gly Asn Tyr Lys 
805 



Ser Gly Tyr Lys 

780 

Asn Asp Gly Lys 
7 95 

Leu Tyr Glu lie 
810 



Gin Phe Val Pro 



He Tyr Phe Lys 

800 

Ser Ser Pro Asp 
815 
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Gly Tyr lie Glu Val Lys Thr Lys 

820 

Asn Gly Glu Val Thr Asn Leu Lys 
835 840 

Gin lie Gly Tyr Leu Glu Gly Asn 
850 855 

Pro Lys Arg Pro Pro Gly Val Phe 
865 870 

lie Val Tyr lie Leu Val Gly Ser 

885 

Ser Phe Arg Arg Lys Gin Leu 

900 



<210> 11 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
oligonucleotide 

<400> 11 

ctaggtggat ccttcggcaa t 

<210> 12 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
oligonucleotide 

<400> 12 
cgattgccga 



Pro Val Val Thr Phe Thr He Gin 
825 830 

Ala Asp Pro Asn Ala Asn Lys Asn 

845 

Gly Lys His Leu He Thr Asn Thr 

860 

Pro Lys Thr Gly Gly lie Gly Thr 
875 880 

Thr Phe Met He Leu Thr He Cys 
890 895 



Sequence : 



21 



Sequence : 



10 



<210> 13 
<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 13 

aggcaactgt gctaaccgag ggaat 



<210> 14 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 14 
cgattccctc g 



<210> 15 
<211> 1509 
<212> DNA 

<213> Streptococcus agalactiae 

<220> 

<221> CDS 

<222> (1) . . (1509) 

<400> 15 

atg aaa aag aaa atg att caa teg ctg 
Met Lys Lys Lys Met lie Gin Ser Leu 
1 5 

ggt atg get gta tea cca gtt acg ccg 
Gly Met Ala Val Ser Pro Val Thr Pro 

20 25 

ggg aca att aca gtt caa gat act caa 
Gly Thr lie Thr Val Gin Asp Thr Gin 

35 40 

tat aaa gtt ttt gat gca gaa ata gat 
Tyr Lys Val Phe Asp Ala Glu lie Asp 

16 



tta gtg gcg agt tta gca ttt 48 
Leu Val Ala Ser Leu Ala Phe 
10 15 

ata get ttt gee get gag aca 96 
lie Ala Phe Ala Ala Glu Thr 

30 

aaa ggc gca ace tat aaa gca 144 
Lys Gly Ala Thr Tyr Lys Ala 

45 

aat gca aat gta tct gat teg 192 
Asn Ala Asn Val Ser Asp Ser 
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50 55 60 

aat aaa gat gga get tct tat tta att cct caa ggt aaa gaa get gag 240 
Asn Lys Asp Gly Ala Ser Tyr Leu lie Pro Gin Gly Lys Glu Ala Glu 
65 70 75 80 

tat aaa get tea act gat ttt aat tct ctt ttt acg aca act act aat 288 
Tyr Lys Ala Ser Thr Asp Phe Asn Ser Leu Phe Thr Thr Thr Thr Asn 

85 90 95 

gga ggg aga aca tat gta act aaa aaa gat act gcg tea gca aat gag 336 
Gly Gly Arg Thr Tyr Val Thr Lys Lys Asp Thr Ala Ser Ala Asn Glu 

100 105 110 

att gcg aca tgg get aaa tct ata tea get aat act aca cca gtt tec 384 
He Ala Thr Trp Ala Lys Ser He Ser Ala Asn Thr Thr Pro Val Ser 
115 120 125 

act gtt act gag tea aat aat gat ggt act gag gtt att aat gtt tec 432 
Thr Val Thr Glu Ser Asn Asn Asp Gly Thr Glu Val He Asn Val Ser-* 
130 135 140 

caa tat gga tat tat tat gtt tct age act gtt aat aat gga get gta 480 
Gin Tyr Gly Tyr Tyr Tyr Val Ser Ser Thr Val Asn Asn Gly Ala Val 
145 150 155 160 

att atg gtt aca tct gta act cca aat get act att cat gaa aag aat 528 
He Met Val Thr Ser Val Thr Pro Asn Ala Thr He His Glu Lys Asn 

165 170 175 

act gat gcg aca tgg gga gat ggt ggt gga aaa act gta gat caa aaa 57 6 
Thr Asp Ala Thr Trp Gly Asp Gly Gly Gly Lys Thr Val Asp Gin Lys 

180 185 190 

acg tac teg gtt ggt gat aca gtc aaa tat act att act tat aag aat 624 
Thr Tyr Ser Val Gly Asp' Thr Val Lys Tyr Thr He Thr Tyr Lys Asn 
195 200 205 

gca gtc aat tat cat ggt aca gaa aaa gtg tat caa tat gtt ata aag 672 
Ala Val Asn Tyr His Gly Thr Glu Lys Val Tyr Gin Tyr Val He Lys 

210 215 220 

gat act atg cca tct get tct gta gtt gat ttg aac gaa ggg tct tat 720 
Asp Thr Met Pro Ser Ala Ser Val Val Asp Leu Asn Glu Gly Ser Tyr 
225 230 235 240 

gaa gta act att act gat gga tea ggg aat att aca act eta act caa 7 68 
Glu Val Thr lie Thr Asp Gly Ser Gly Asn He Thr Thr Leu Thr Gin 
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245 250 255 

ggt teg gaa aaa gca act ggg aag tat aac ctg tta gag gaa aat aat 816 

Gly Ser Glu Lys Ala Thr Gly Lys Tyr Asn Leu Leu Glu Glu Asn Asn 

260 265 270 

aat ttc acg att act att ccg tgg gca get acc aat act cca acc gga 8 64 

Asn Phe Thr lie Thr lie Pro Trp Ala Ala Thr Asn Thr Pro Thr Gly 
275 280 285 

aat act caa aat gga get aat gat gac ttt ttt tat aag gga ata aat 912 

Asn Thr Gin Asn Gly Ala Asn Asp Asp Phe Phe Tyr Lys Gly lie Asn 
290 295 300 

aca ate aca gtc act tat aca gga gta tta aag agt gga get aaa cca 960 

Thr lie Thr Val Thr Tyr Thr Gly Val Leu Lys Ser Gly Ala Lys Pro 

305 310 315 320 

ggt tea get gat tta cca gaa aat aca aac att gcg acc ate aac ccc 1008 

Gly Ser Ala Asp - Leu Pro Glu Asn Thr Asn lie Ala Thr He Asn Pre 

325 330 _ 335 

aat act age aat gat gac cca ggt caa aaa gta aca gtg agg gat ggt 1056 

Asn Thr Ser Asn Asp Asp Pro Gly Gin Lys Val Thr Val Arg Asp Gly 

340 345 350 

caa att act ata aaa aaa att gat ggt tec aca' aaa get tea tta caa 1104 

Gin He Thr He Lys Lys He Asp Gly Ser Thr Lys Ala Ser Leu Gin 
355 360 365 

ggt get ata ttt gtt tta aag aat get acg ggt caa ttt eta aac ttt 1152 

Gly Ala He Phe Val Leu Lys Asn Ala Thr Gly Gin Phe Leu Asn Phe 
370 375 380 

aac gat aca aat aac gtt gaa tgg ggc aca gaa get aat gca aca gaa 1200 

Asn Asp Thr Asn Asn Val Glu Trp Gly Thr Glu Ala Asn Ala Thr Glu 

385 390 395 400 

tat aca aca gga gca gat ggt ata att acc att aca ggc ttg aaa gaa 1248 

Tyr Thr Thr Gly Ala Asp Gly He lie Thr He Thr Gly Leu Lys Glu 

405 410 415 

ggt aca tac tat eta gtt gag aaa aag get ccc tta ggt tac aat ttg 1296 

Gly Thr Tyr Tyr Leu Val Glu Lys Lys Ala Pro Leu Gly Tyr Asn Leu 

420 425 430 

tta gat aac tct cag aag gtt att tta gga gat gga gec act gat acg 1344 

Leu Asp Asn Ser Gin Lys Val He Leu Gly Asp Gly Ala Thr Asp Thr 
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435 440 445 

act aat tea gat aac ctt tta gtt aac cca act gtt gaa aat aac aaa 1392 
Thr Asn Ser Asp Asn Leu Leu Val Asn Pro Thr Val Glu Asn Asn Lys 
450 455 • 460 

ggt act gag ttg cct tea aca ggt ggt att ggt aca aca att ttc tac 1440 
Gly Thr Glu Leu Pro Ser Thr Gly Gly lie Gly Thr Thr lie Phe Tyr 
465 470 475 480 

att ata ggt gca att tta gta ata gga gca ggt ate gtg ctt gtt get 1488 
lie lie Gly Ala lie Leu Val lie Gly Ala Gly lie Val Leu Val Ala 

485 490 495 

cgt cgt cgt tta cgt tct taa 1509 
Arg Arg Arg Leu Arg Ser 

500 



<210> 16 
<211> 502 
<212> PRT 

<213> Streptococcus agalactiae 



<400> 16 

Met Lys Lys Lys Met lie Gin Ser Leu Leu Val Ala Ser Leu Ala Phe 
1 5 10 15 

Gly Met Ala Val Ser Pro Val Thr Pro lie Ala Phe Ala Ala Glu Thr 

20 25 30 

Gly Thr lie Thr Val Gin Asp Thr Gin Lys Gly Ala Thr Tyr Lys Ala 

35 40 45 

Tyr Lys Val Phe Asp Ala Glu lie Asp Asn Ala Asn Val Ser Asp Ser 
50 55 60 

Asn Lys Asp Gly Ala Ser Tyr Leu lie Pro Gin Gly Lys Glu Ala Glu 
65 70 75 80 

Tyr Lys Ala Ser Thr Asp Phe Asn Ser Leu Phe Thr Thr Thr Thr Asn 

85 90 95 

Gly Gly Arg Thr Tyr Val Thr Lys Lys Asp Thr Ala Ser Ala Asn Glu 

100 105 HO 

lie Ala Thr Trp Ala Lys Ser lie Ser Ala Asn Thr Thr Pro Val Ser 
115 120 125 

19 
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Thr Val Thr 
130 

Gin Tyr Gly 
145 

He Met Val 



Thr Asp Ala 



Thr Tyr Ser 
195 

Ala Val Asn 
210 

Asp Thr Met 
225 

Glu Val Thr 



Gly Ser Glu 



Asn Phe Thr 
275 

Asn Thr Gin 

290 

Thr He Thr 
305 

Gly Ser Ala 



Asn Thr Ser 



Gin He Thr 
355 

Gly Ala He 
370 



Glu Ser Asn 



Tyr Tyr Tyr 
150 

Thr Ser Val 
165 

Thr Trp Gly 
180 

Val Gly Asp 



Tyr His Gly 



Pro Ser Ala 
230 

He Thr Asp 
245 

Lys Ala Thr 
260 

He Thr lie 



Asn Gly Ala 



Val Thr Tyr 
310 

Asp Leu Pro 
325 

Asn Asp Asp 
340 

He Lys Lys 



Phe Val Leu 



Asn Asp Gly Thr 
135 

Val Ser Ser Thr 



Thr Pro Asn Ala 

170 

Asp Gly Gly Gly 
185 

Thr Val Lys Tyr 
200 

Thr Glu Lys Val 
215 

Ser Val Val Asp 



Gly Ser Gly Asn 

250 

Gly Lys Tyr Asn 
265 

Pro Trp Ala Ala 
280 

Asn "Asp Asp Phe 
295 

Thr Gly Val Leu 



Glu Asn Thr Asn 

330 

Pro Gly Gin Lys 
345 

He Asp Gly Ser 
360 

Lys Asn Ala Thr 
375 



Glu Val He 
140 

Val Asn Asn 
155 

Thr He His 



Lys Thr Val 



Thr He Thr 
205 

Tyr Gin Tyr 
220 

Leu Asn Glu 
235 

He Thr Thr 



Leu Leu Glu 



Thr Asn Thr 
285 

Phe Tyr Lys 
300 

Lys Ser Gly 
315 

He Ala Thr 



Val Thr Val 

i 

Thr Lys Ala 
365 

Gly Gin Phe 
380 



Asn Val Ser 



Gly Ala Val 
160 

Glu Lys Asn 
175 

Asp Gin Lys 
190 

Tyr Lys Asn 



Val He Lys 



Gly Ser Tyr* 
240 

Leu Thr Gin 
255 

Glu Asn Asn 
270 

Pro Thr Gly 



Gly He Asn 



Ala Lys Pro 
320 

He Asn Pro 
335 

Arg Asp Gly 
350 

Ser Leu Gin 



Leu Asn Phe 
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Asn Asp Thr Asn 
385 

Tyr Thr Thr Gly 



Gly Thr Tyr Tyr 

420 

Leu Asp Asn Ser 
435 

Thr Asn Ser Asp 
450 

Gly Thr Glu Leu 
465 

lie lie Gly Ala 



Arg Arg Arg Leu 

500 



Asn Val Glu Trp 
390 

Ala Asp Gly lie 
405 

Leu Val Glu Lys 



Gin Lys Val lie 

440 

Asn Leu Leu Val 
455 

Pro Ser Thr Gly 
470 

lie Leu Val lie 
485 

Arg Ser 



Gly Thr Glu Ala 
395 

lie Thr lie Thr 
410 

Lys Ala Pro Leu 
425 

Leu Gly Asp Gly 



Asn Pro Thr Val 

460 

Gly lie Qly Thr 
475 

Gly Ala Gly lie 
490 



Asn Ala Thr Glu 

400 

Gly Leu Lys Glu 
415 

Gly Tyr Asn Leu 
430 

Ala Thr Asp Thr 
445 

Glu Asn Asn Lys 



Thr lie Phe Tyr 

480 

Val Leu Val Ala^ 
495 



<210> 17 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
<220> 

<223> X can be any amino acid 

<400> 17 c 
Leu Pro Xaa Thr Gly 
1 5 



<210> 18 
<211> 1683 
<212> DNA 

<213> Streptococcus agalactiae 
<220> 
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<221> CDS 

<222> (1) . . (1683) 



<400> 18 

atg gtg ate gta ttc egg att ata cag ata tta caa ggg att ata tec 48 

Met Val lie Val Phe Arg lie lie Gin lie Leu Gin Gly He He Ser 
15 10 15 



aag ate ctt cag gta cat att att ata agt atg att cac gag ata aag 96 
Lys He Leu Gin Val His He He He Ser Met He His Glu He Lys 

20 25 30 



ate ccg act caa eta aag atg cct att ata cga cag ata eta gtc tea 144 
He Pro Thr Gin Leu Lys Met Pro He He Arg Gin He Leu Val Ser 

35 40 45 



tea aat gtt gat aca aca act aag 
Ser Asn Val Asp Thr Thr Thr Lys 
50 55 

aaa tta gtc ggt tgg tat tat gtt 
Lys Leu Val Gly Trp Tyr Tyr Val 
65 70 



tac aag tac gta aaa gac get tac 192 

Tyr Lys Tyr Val Lys Asp Ala Tyr 

60 

aat cca tat ggt agt att aga cct 240 

Asn Pro Tyr Gly Ser He Arg Pro 

75 80 



tat aac ttt tea ggt get gta act caa gat ate aat tta aga get att 288 
Tyr Asn Phe Ser Gly Ala Val Thr Gin Asp He Asn Leu Arg Ala He 

85 90 95 



tgg cga aag get gga gat tat cat att ata tac age aat gat get gtt 336 

Trp Arg Lys Ala Gly Asp Tyr His He He Tyr Ser Asn Asp Ala Val 

100 105 110 

ggt aca gat gga aag cca gca ttg gat get tct ggt cag caa tta caa 384 

Gly Thr Asp Gly Lys Pro Ala Leu Asp Ala Ser Gly Gin Gin Leu Gin 
115 120 125 



aca agt aat gag cct act gac cct gat tec tat gac gat ggc tec cat 432 
Thr Ser Asn Glu Pro Thr Asp Pro Asp Ser Tyr Asp Asp Gly Ser His 
130 135 140 



tea gec tta ctg aga cgt ccg aca 

Ser Ala Leu Leu Arg Arg Pro Thr 
145 150 

ggc tgg tgg tac aat ggt aaa att 

Gly Trp Trp Tyr Asn Gly Lys He 

165 



atg cca gat ggc tat cgt ttc cgt 480 
Met Pro Asp Gly Tyr Arg Phe Arg 
155 160 

tat aac cca tat gat tec att gat 528 
Tyr Asn Pro Tyr Asp Ser He Asp 
170 175 
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att gac gcc cat tta gca gat get aat aaa aat ate acc ata aaa cct 576 
lie Asp Ala His Leu Ala Asp Ala Asn Lys Asn He Thr He Lys Pro 

180 185 190 

gtc att att cca gta gga gat ate aaa tta gaa gat acc tec ate aaa 624 
Val He He Pro Val Gly Asp He Lys Leu Glu Asp Thr Ser He Lys 
195 200 205 

tac aat ggt. aac ggt ggt act aga gta gaa aat ggt aat gtg gta aca 672 
Tyr Asn Gly Asn Gly Gly Thr Arg Val Glu Asn Gly Asn Val Val Thr 
210 215 220 

caa gtg gag aca ccg cgt atg gag ttg aat age aca act aca att cct 720 
Gin Val Glu Thr Pro Arg Met Glu Leu Asn Ser Thr Thr Thr He Pro 
225 230 235 240 

gaa aac caa tac ttt aca agg aca ggt tac aac ctt att ggt tgg cat 768 
Glu Asn Gin Tyr Phe Thr Arg Thr Gly Tyr Asn Leu He Gly Trp His 

245 250 255 



cat gat aag gat tta get gat aca gga cgt gtg gaa ttt aca gca ggt 
His Asp Lys Asp Leu Ala Asp Thr Gly Arg Val Glu Phe Thr Ala Gly 

260 265 270 

caa tea ata ggt att gat aac aac ctt gat gca aca aat acc tta tat 
Gin Ser He Gly He Asp Asn Asn Leu Asp Ala Thr Asn Thr Leu Tyr 
275 280 285 

get gtt tgg caa cct aaa gaa tac acc gtc gga gta agt aaa act gtc 
Ala Val Trp Gin Pro Lys Glu Tyr Thr Val Gly Val Ser Lys Thr Val 
290 295 300 

gtt gga eta gat gaa gat aag acg aaa gac ttc ttg ttt aat cca agt 
Val Gly Leu Asp Glu Asp Lys Thr Lys Asp Phe Leu Phe Asn Pro Ser 
305 310 315 320 

gaa acg ttg caa caa gag aat ttt ccg ctg aga gat ggt cag act aag 
Glu Thr Leu Gin Gin Glu Asn Phe Pro Leu Arg Asp Gly Gin Thr Lys 

325 330 335 

gaa ttt aaa gta cct tat gga act tct ata tea ata gat gaa caa gcc 
Glu Phe Lys Val Pro Tyr Gly Thr Ser He Ser He Asp Glu Gin Ala 

340 345 35 0 

tac gat gaa ttt aaa gta tct gag tea att aca gaa aaa aat eta gca 
Tyr Asp Glu Phe Lys Val Ser Glu Ser He Thr Glu Lys Asn Leu Ala 
355 360 365 



816 



864 



912 



960 



1008 



1056 



1104 
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act ggt gaa get gat aaa act tat gat get acc ggc tta caa tec ctg 1152 
Thr Gly Glu Ala Asp Lys Thr Tyr Asp Ala Thr Gly Leu Gin Ser Leu 
370 375 ' 380 



aca gtt tea gga gac gta gat att age ttt acc aat aca cgt ate aag 
Thr Val Ser Gly Asp Val Asp lie Ser Phe Thr Asn Thr Arg lie Lys 
385 390 395 400 



tta gca ggt gca gtt ttt gat att tat gaa tea gat get aat ggg aat 
Leu Ala Gly Ala Val Phe Asp He Tyr Glu Ser Asp Ala Asn Gly Asn 

420 425 430 



1200 



caa aaa gta cga eta cag aaa gtt aat gtc gaa aat gat aat aat ttt 12 4 8 
Gin Lys Val Arg Leu Gin Lys Val Asn Val Glu Asn Asp Asn Asn Phe 

405 410 415 



1296 



aaa get tea cat cct atg tat tea ggg ctg gtg aca aac gat aaa ggc 1344 
Lys Ala Ser His Pro Met Tyr Ser Gly Leu Val Thr Asn Asp Lys Gly 
435 440 445 

ttg tta tta gtg gat get aat aac tac etc agt ttg cca gta gga aaa 1392 
Leu Leu Leu Val Asp Ala Asn Asn Tyr Leu Ser Leu Pro Val Gly Lys 
450 455 460 

tac tac eta aca gag aca aag gee cct cca ggg tac eta eta cct aaa 1440 
Tyr Tyr Leu Thr Glu Thr Lys Ala Pro Pro Gly Tyr Leu Leu Pro Lys 
465 470 475 480 

aat gat gat ata tea gta tta gtg att tct acg gga gtt acc ttt gaa 1488 
Asn Asp Asp He Ser Val Leu Val He Ser Thr Gly Val Thr Phe Glu 

485 490 495 

caa aat ggt aat aat gcg aca cca ata aaa gag aat tta gtg gat gga 1536 
Gin Asn Gly Asn Asn Ala Thr Pro lie Lys Glu Asn Leu Val Asp Gly 

500 505 510 

agt aca gta tat act ttt aaa att act aac agt aaa gga aca gaa ttg 1584 
Ser Thr Val Tyr Thr Phe Lys He Thr Asn Ser Lys Gly Thr Glu Leu 
515 520 525 



cct agt act gga ggt att gga aca cac att tat ate eta gtt ggt tta 1632 

Pro Ser Thr Gly Gly lie Gly Thr His He Tyr He Leu Val Gly Leu 
530 535 540 

get tta get eta cca tea gga tta ata tta tac tat cga aaa aaa ata 1680 

Ala Leu Ala Leu Pro Ser Gly Leu He Leu Tyr Tyr Arg Lys Lys lie 

545 550 555 560 
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tga 



1683 



<210> 19 
<211> 560 
<212> PRT 

<213> Streptococcus agalactiae 



<400> 19 

Met Val lie Val Phe 
1 5 

Lys lie Leu Gin Val 

20 

lie Pro Thr Gin Leu 

35 

Ser Asn Val Asp Thr 
50 

Lys Leu Val Gly Trp 
65 

Tyr Asn Phe Ser Gly 

85 

Trp Arg Lys Ala Gly 

100 

Gly Thr Asp Gly Lys 
115 

Thr Ser Asn Glu Pro 
130 

Ser Ala Leu Leu Arg 
145 

Gly Trp Trp Tyr Asn 

165 

lie Asp Ala His Leu 

180 

Val lie lie Pro Val 
195 



Arg lie lie Gin lie Leu 

10 

His He He He Ser Met 

25 

Lys Met Pro He He Arg 

40 

Thr Thr Lys Tyr Lys Tyr 
55 

Tyr Tyr Val Asn Pro Tyr 
70 75 

Ala Val Thr Gin Asp He 

90 

Asp Tyr His He He Tyr 

105 

Pro Ala Leu Asp Ala Ser 
120 

Thr Asp Pro Asp Ser Tyr 
135 

Arg Pro Thr Met Pro Asp 
150 155 

Gly Lys He Tyr Asn Pro 

170 

Ala Asp Ala Asn Lys Asn 

185 

Gly Asp He Lys Leu Glu 
200 



Gin Gly He He Ser 

15 

He His Glu He Lys 

30 

Gin He Leu Val Ser 
45 

Val Lys Asp Ala Tytf 

60 

Gly Ser He Arg Pro 

80 

Asn Leu Arg Ala He 

95 

Ser Asn Asp Ala Val 
110 

Gly Gin Gin Leu Gin 
125 

Asp Asp Gly Ser His 
140 

Gly Tyr Arg Phe Arg 

160 

Tyr Asp Ser He Asp 

175 

He Thr He Lys Pro 
190 

Asp Thr Ser He Lys 
205 
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Tyr Asn Gly Asn 
210 

Gin Val Glu Thr 

225 

Glu Asn Gin Tyr 



His Asp Lys Asp 

260 

Gin Ser lie Gly 
275 

Ala Val Trp Gin 
290 

Val Gly Leu Asp 
305 

Glu Thr Leu Gin 



Glu Phe Lys Val 

340 

Tyr Asp Glu Phe 
355 

Thr Gly Glu Ala 
370 

Thr Val Ser Gly 
385 

Gin Lys Val Arg 



Leu Ala Gly Ala 

420 

Lys Ala Ser His 
435 

Leu Leu Leu Val 
450 



Gly Gly Thr Arg 
215 

Pro Arg Met Glu 
230 

Phe Thr Arg Thr 
245 

Leu Ala Asp Thr 



lie Asp Asn Asn 

280 

Pro Lys Glu Tyr 
295 

Glu Asp Lys Thr 
310 

Gin Glu Asn Phe 
325 

Pro Tyr Gly Thr 



Lys Val Ser Glu 

360 

Asp Lys Thr Tyr 
375 

Asp Val Asp lie 
3 90- 

Leu Gin Lys Val 
405 

Val Phe Asp lie 



Pro Met Tyr Ser 

440 

Asp Ala Asn Asn 
455 



Val Glu Asn Gly 

220 

Leu Asn Ser Thr 
235 

Gly Tyr Asn Leu 

250 

Gly Arg Val Glu 
265 

Leu Asp Ala Thr 



Thr Val Gly Val 

300 

Lys Asp Phe Leu 
315 

Pro Leu Arg Asp 
330 

Ser lie Ser lie 
345 

Ser lie Thr Glu 



Asp Ala Thr Gly 

380 

Ser Phe Thr Asn 
395 

Asn Val Glu Asn 
410 

Tyr Glu Ser Asp 
425 

■ 

Gly Leu Val Thr 



Tyr Leu Ser Leu 

460 



Asn Val Val Thr 



Thr Thr lie Pro 

240 

lie Gly Trp His 
255 

Phe Thr Ala Gly 
270 

Asn Thr Leu Tyr 
285 

Ser Lys Thr Val 



Phe Asn Pro Ser"' 

320 

Gly Gin Thr Lys 
335 

Asp Glu Gin Ala 
350 

Lys Asn Leu Ala 
365 

Leu Gin Ser Leu 



Thr Arg lie Lys 

400 

Asp Asn Asn Phe 
415 

Ala Asn Gly Asn 
430 

Asn Asp Lys Gly 
445 

Pro Val Gly Lys 
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Tyr Tyr Leu Thr Glu Thr Lys Ala Pro Pro Gly Tyr Leu Leu Pro Lys 

465 470 475 480 

Asn Asp Asp lie Ser Val Leu Val He Ser Thr Gly Val Thr Phe Glu 

485 490 495 

Gin Asn Gly Asn Asn Ala Thr. Pro He Lys Glu Asn Leu Val Asp Gly 

500 505 510 

i 

Ser Thr Val Tyr Thr Phe Lys He Thr Asn Ser Lys Gly Thr Glu Leu 

515 520 525 

Pro Ser Thr Gly Gly He Gly Thr His He Tyr He Leu Val Gly Leu 

530 535 540 

Ala Leu Ala Leu Pro Ser Gly Leu He Leu Tyr Tyr Arg Lys Lys He 

545 550 555 560 



<210> 20 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
<400> 20 

Leu Pro Ser Thr Gly Gly 
1 5 



<210> 21 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
<220> 

<223> X can be any amino acid. 
<400> 21 

Xaa Pro Xaa Thr Gly Gly 
1 5 
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<210> 22 
<211> 2714 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 22 

caatcagaaa ttaccacgtg gcaatgttga 
ctctcttcaa ggggcaatgt tcaaagtcat 
tcttcaaaat ggtaaggaag tagttgtaac 
aggtctagag tatgggacat actatttatg 
attaacatcg cctgtttcct ttacaatcgg 
ggttaaaaat aacaagcgac cacggattga 
tatcttgatg cttgttgcca ttttgttgtt 
aaataactga tattcaatgt acatcattat 
gagtactctg aggtgatgtt aatcaggaat 
gatatgaggc tgggcagatt gtgccagcct 
gactggtctg gtaatcatt't taggaatgga 
gtgaatcaga aagaaatgag attttctcgt 
aaaagcgata aaatgatgag tttgaagata 
aaaagcaaaa acgaaataat ctcctattag 
tggcgtatcc gctggtgtct cgcttgtatt 
actttgataa ggaaaaagca acgttggatg 
cacaagcctt caatgactct ttgaataatg 
tgaagaaaaa agggcgagca gagtatgcac 
atgtggaaat ccccgttatt gacgtggatt 
tattgcagca aggggctggg catctagagg 
cccatgcggt gattacggca catacaggtt 

28 



ctttatgaag gtggatggtc ggaccaatac 60 
gaaagaagaa agcggacact atactcctgt 120 
atcagggaaa gatggtcgtt tccgagtgga 180 
ggagctccaa gctccaactg gttatgttca 240 
gaaagatact cgtaaggaac tggtaacagt 300 
tgtgccagat acaggggaag aaaccttgta 360 
tggtagtggt tattatctta cgaaaaaafcc 420 
gaaaaagata gcaggctgaa gggaagacca 480 
catggtgatg tggcatgaat cacaataacg 540 
cattgtgggt tattgtttgt aaaacgatag 600 
caggactggg attctgattt aaaatggatg 660 
ttctcttagc agataggatt gtctgttagg 720 
aagggatgct gataaaaatg gtaaaaacaa 780 
gagtggtatt tttcattgga atggcggtaa 840 
atcgagtgga atcaaatcaa caaattgctg 900 
aggctgacat tgatgaacga atgaaattgg 960 
tagtgagtgg cgatccttgg tcggaagaaa 1020 
gtatgttaga aatccatgag cggatggggc 1080 
tgccggttta tgctggtact gctgaagagg 1140 
gaacttctct gccgatcgga ggcaattcga 1200 
tgccaacagc taagatgttt acggatttga 1260 
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ccaaacttaa agttggggat aagttttatg tgcacaatat caaggaagtg atggcctatc 1320 

aagtggatca agtaaaggtg attgagccga cgaactttga tgatttattg attgtaccag 1380 

gtcatgatta tgtgaccttg ctgacttgta cgccatacat gatcaatacc catcgtctat 1440 

tggttcgggg gcatcggata ccgtacgtag cagaggttga ggaagaattt attgcagcaa 1500 

acaaactcag tcatctctat cgctacctgt tttatgtggc agttggtttg attgtgattc 15 60 

ttttatggat tattcgacgc ttgcgcaaga agaaaaaaca accggaaaag gctttgaagg 1620 

cgctgaaagc agcaaggaag gaagtgaagg tggaggatgg acaacagtag acgttcacga 1680 

aaaaaaggca caaaaaagaa gaaacatccg ctgatccttc ttctgatttt cttagtagga 1740 

ttcgccgttg cgatatatcc attggtgtct cgttattatt atcgtattga gtcaaacgag 1800 

gttattaaag agtttgatga gacggtttcc cagatggata aggcagaact tgaggagcgt 18 60 

* 

tggcgcttgg ctcaagcctt caatgcgacc ttgaaaccat ctgaaattct tgatcctttt 1920 
acagagcaag agaaaaagaa aggcgtctca gaatatgcca atatgctaaa ggtccatgag 1980 
cggattggct atgtggaaat tcctgcgatt gatcaggaaa ttccgatgta tgtcggaacg 2040 
agtgaggaca ttcttcagaa aggggcaggg ctgttagaag gggcttcgct gcctgttgga 2100 
ggtgaaaata cccatacagt gatcactgct cacagaggat tgccaacggc agaattgttc 2160 
agtcaattgg ataagatgaa aaaaggggat atcttttatc ttcacgtttt agatcaggtg 2220 
ttggcctacc aagtggatca gatagtgacg gtggagccga atgactttga gcctgtcttg 2280 
attcaacatg gggaagatta tgcgaccttg ttgacttgta caccgtatat gattaacagt 2340 
catcgtctgt tggtacgtgg gaagcggatt ccgtatacgg caccaattgc agagcggaat 2400 
cgagcggtga gagagcgtgg gcaattctgg ttgtggttat tactaggagc gatggcggtc 24 60 
atccttctct tgctgtatcg cgtgtatcgt aatcgacgga ttgtcaaagg actagaaaag 2520 

— 

caattggagg ggcgtcatgt caaggactaa actacgagcc ttattgggat acttgttgat 2580 

gttggtagcc tgtttgattc ctatttattg ttttggacag atggtgttgc agtctcttgg 2 64 0 

acaggtgaaa ggtcatgcta catttgtgaa atccatgaca actgaaatgt accaagaaca 2700 
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acagaaccat tctc 2714 



<210> 23 
<211> 297 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 23 

Met Asp Asn Ser Arg Arg Ser Arg Lys Lys Gly Thr Lys Lys Lys Lys 
15 10 15 

His Pro Leu lie Leu Leu Leu lie Phe Leu Val Gly Phe Ala Val Ala 

20 25 30 

lie Tyr Pro Leu Val Ser Arg Tyr Tyr Tyr Arg lie Glu Ser Asn Glu 

35 40 45 

Val lie Lys Glu Phe Asp Glu Thr Val Ser Gin Met Asp Lys Ala Glu^ 
50 55 60 . - 

Leu Glu Glu Arg Trp Arg Leu Ala Gin Ala Phe Asn Ala Thr Leu Lys 
65 70 75 80 

Pro Ser Glu lie Leu Asp Pro Phe Thr Glu Gin Glu Lys Lys Lys Gly 

85 * 90 95 

Val Ser Glu Tyr Ala Asn Met Leu Lys Val His Glu Arg lie Gly Tyr 

100 105 110 

Val Glu lie Pro Ala lie Asp Gin Glu He Pro Met Tyr Val Gly Thr 
115 120 125 

Ser Glu Asp He Leu Gin Lys Gly Ala Gly Leu Leu Glu Gly Ala Ser 
130 135 140 

Leu Pro Val Gly Gly Glu Asn Thr His Thr Val lie Thr Ala His Arg 
145 150 155 160 

Gly Leu Pro Thr Ala Glu Leu Phe Ser Gin Leu Asp Lys Met Lys Lys 

165 170 175 

Gly Asp He Phe Tyr Leu His Val Leu Asp Gin Val Leu Ala Tyr Gin 

180 185 190 

Val Asp Gin lie Val Thr Val Glu Pro Asn Asp Phe Glu Pro Val Leu 
195 200 205 
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lie Gin His Gly Glu Asp Tyr Ala Thr Leu Leu Thr Cys Thr Pro Tyr 
210 215 220 

Met lie Asn Ser His Arg Leu Leu Val Arg Gly Lys Arg He Pro Tyr 
225 230 235 240 

Thr Ala Pro He Ala Glu Arg Asn Arg Ala Val Arg Glu Arg Gly Gin 

245 250 255 

Phe Trp Leu Trp Leu Leu Leu Gly Ala Met Ala Val He Leu Leu Leu 

260 265 270 

Leu Tyr Arg Val Tyr Arg Asn Arg Arg He Val Lys Gly Leu Glu Lys 
275 280 285 

Gin Leu Glu Gly Arg His Val Lys Asp 
290 295 



<210> 24 
<211> 894 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 24 

atggacaaca gtagacgttc acgaaaaaaa ggcacaaaaa agaagaaaca tccgctgatc 60 
cttcttctga ttttcttagt aggattcgcc gttgcgatat atccattggt gtctcgttat 120 

i 

tattatcgta ttgagtcaaa cgaggttatt aaagagtttg atgagacggt ttcccagatg 180 

gataaggcag aacttgagga gcgttggcgc ttggctcaag ccttcaatgc gaccttgaaa 240 

ccatctgaaa ttcttgatcc ttttacagag caagagaaaa agaaaggcgt ctcagaatat 300 

gccaatatgc taaaggtcca tgagcggatt ggctatgtgg aaattcctgc gattgatcag 360 

gaaattccga tgtatgtcgg aacgagtgag gacattcttc agaaaggggc agggctgtta 420 

gaaggggctt ' cgctgcctgt tggaggtgaa aatacccata cagtgatcac tgctcacaga 480 

ggattgccaa cggcagaatt gttcagtcaa ttggataaga tgaaaaaagg ggatatcttt 54 0 

tatcttcacg ttttagatca ggtgttggcc taccaagtgg atcagatagt gacggtggag 600 

ccgaatgact ttgagcctgt cttgattcaa catggggaag attatgcgac cttgttgact 660 
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tgtacaccgt atatgattaa cagtcatcgt ctgttggtac gtgggaagcg gattccgtat 720 
acggcaccaa ttgcagagcg gaatcgagcg gtgagagagc gtgggcaatt ctggttgtgg 780 
ttattactag gagcgatggc ggtcatcctt ctcttgctgt atcgcgtgta tcgtaatcga 840 
cggattgtca aaggactaga aaagcaattg gaggggcgtc atgtcaagga ctaa 894 



<210> 25 
<211> 3010 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 25 

tgttaggaaa agcgataaaa tgatgagttt gaagataaag ggatgctgat aaaaatggta 60 

aaaacaaaaa agcaaaaacg aaataatctc ctattaggag tggtattttt cattggaatg 120 

gcggtaatgg cgtatccgct ggtgtctcgc ttgtattatc gagtggaatc aaatcaadaa 180 

attgctgact ttgataagga aaaagcaacg ttggatgagg ctgacattga tgaacgaatg 240 

aaattggcac aagccttcaa tgactctttg aataatgtag tgagtggcga tccttggtcg 300 

gaagaaatga agaaaaaagg gcgagcagag tatgcacgta tgttagaaat ccatgagcgg 360 

atggggcatg tggaaatccc cgttattgac gtggatttgc cggtttatgc tggtactgct 420 

gaagaggtat tgcagcaagg ggctgggcat ctagagggaa cttctctgcc gatcggaggc 480 

aattcgaccc atgcggtgat tacggcacat acaggtttgc caacagctaa gatgtttacg 540 

gatttgacca aacttaaagt tggggataag ttttatgtgc acaatatcaa ggaagtgatg 600 

gcctatcaag tggatcaagt aaaggtgatt gagccgacga actttgatga tttattgatt 660 

gtaccaggtc atgattatgt gaccttgctg acttgtacgc catacatgat caatacccat 720 

cgtctattgg ttcgggggca tcggataccg tacgtagcag aggttgagga agaatttatt 780 

gcagcaaaca aactcagtca tctctatcgc tacctgtttt atgtggcagt tggtttgatt 8 40 

gtgattcttt tatggattat tcgacgcttg cgcaagaaga 'aaaaacaacc ggaaaaggct 900 

ttgaaggcgc tgaaagcagc aaggaaggaa gtgaaggtgg aggatggaca acagtagacg 960 

ttcacgaaaa aaaggcacaa aaaagaagaa acatccgctg atccttcttc tgattttctt 1020 
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agtaggattc gccgttgcga tatatccatt 
aaacgaggtt attaaagagt ttgatgagac 
ggagcgttgg cgcttggctc aagccttcaa 
tccttttaca gagcaagaga aaaagaaagg 
ccatgagcgg attggctatg tggaaattcc 
cggaacgagt gaggacattc ttcagaaagg 
tgttggaggt gaaaataccc atacagtgat 
attgttcagt caattggata agatgaaaaa 
tcaggtgttg gcctaccaag tggatcagat 
tgtcttgatt caacatgggg aagattatgc 
taacagtcat cgtctgttgg tacgtgggaa 
gcggaatcga gcggtgagag agcgtgggca 
ggcggtcatc cttctcttgc tgtatcgcgt 
agaaaagcaa ttggaggggc gtcatgtcaa 
tgttgatgtt ggtagcctgt ttgattccta 
ctcttggaca ggtgaaaggt catgctacat 
aagaacaaca gaaccattct ctcgcctaca 
tagatccttt tttggcggag ggatatgagg 
cagtctatgg ttacttgtct attccaagtt 
cagattatca tcatttaggg atgggcttgg 
atggtacagg gattcgctca gtgattgctg 
tccgccattt ggatcagcta aaagttggag 
ttgtagaata tcagatgatg gacacagaga 
aatcggttag ctctaaaaat atcatgacct 



ggtgtctcgt tattattatc gtattgagtc 1080 
ggtttcccag atggataagg cagaacttga 1140 
tgcgaccttg aaaccatctg aaattcttga 1200 
cgtctcagaa tatgccaata tgctaaaggt 1260 
tgcgattgat caggaaattc cgatgtatgt 1320 
ggcagggctg ttagaagggg cttcgctgcc 1380 
cactgctcac agaggattgc caacggcaga 1440 
aggggatatc ttttatcttc acgttttaga 1500 
agtgacggtg gagccgaatg actttgagcc 1560 
gaccttgttg acttgtacac cgtatatgat 1620 
gcggattccg tatacggcac caattgcaga 1680 
attctggttg tggttattac taggagcgat 1740 
gtatcgtaat cgacggattg tcaaaggact 1800 
ggactaaact acgagcctta ttgggatact 18 60 
tttattgttt tggacagatg gtgttgcagt 1920 
ttgtgaaatc catgacaact gaaatgtacc 1980 
atcaacgctt ggcttcgcaa aatcgcattg 2040 
tcaattacca agtgtctgac gaccctgatg 2100 
tggaaatcat ggagccggtt tatttgggag 2160 
ctcatgtgga tggtacaccg ctgcctctgg 2220 
ggcaccgtgc agagccaagc catgtctttt 2280 
atgctcttta ttatgataat ggccaggaaa 2340 
ttattttacc gtcggaatgg gaaaaattag 2400 
tgataacctg cgatccgatt cctaccttta 24 60 
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ataaacgctt 


attagtgaat 


tttgaacgag 


tcgctgttta 


tcaaaaatca 


gatccacaaa 


2520 


cagctgcagt 


tgcgagggtt 


gcttttacga 


aagaaggaca 


atctgtatcg 


cgtgttgcaa 


2580 


cctctcaatg 


gttgtaccgt 


gggctagtgg 


tactggcatt 


tctgggaatc 


ctgtttgttt 


2640 


tgtggaagct 


agcacgttta 


ctacgaggga 


aataaaaaga 


aatgaaagga 


aagctaaggc 


2700 


tgttcctttt 


tccggctctt 


tgtcaactgt 


agtgggttga 


aaaaaagcta 


agctcgagaa 


2760 


aggacaaatt 


ttgtcctttc 


ttttttgata 


ttcagagcga 


taaaaatccg 


ttttttgaag 


2820 




U. Vw* X-j M t_A CA U V*^ 


aaaaacattcj 


cgcttgataa 


gtttgatgag 


attattggtc 


2880 


gcttccagtt 


tggcattaga 


atagtgtagt 


tgaagggcgt 


tgataacctt 


ttctttatct 


2940 


ttgaggaagg 


ttttaaagac 


agtctgaaaa 


ataggatgaa 


cctgcttaag 


attgtcctcg 


3000 


ataagttcga 












3010 



<210> 26 
<211> 304 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 26 

Met Leu He Lys Met Val Lys Thr Lys Lys Gin Lys Arg Asn Asn Leu 
15 10 15 

Leu Leu Gly Val Val Phe Phe He Gly Met Ala Val Met Ala Tyr Pro 

20 25 30 

Leu Val Ser Arg Leu Tyr Tyr Arg Val Glu Ser Asn Gin Gin He Ala 

35 40 45 

Asp Phe Asp Lys Glu Lys Ala Thr Leu Asp Glu Ala Asp He Asp Glu 
50 55 60 

Arg Met Lys Leu Ala Gin Ala Phe Asn Asp Ser Leu Asn Asn Val Val 
65 70 75 80 

Ser Gly Asp Pro Trp Ser Glu Glu Met Lys Lys Lys Gly Arg Ala Glu 

85 90 95 

Tyr Ala Arg Met Leu Glu He His Glu Arg Met Gly His Val Glu He 

100 105 HO 
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Pro Val lie Asp 
115 

Val Leu Gin Gin 
130 

Gly Gly Asn Ser 
145 

Thr Ala Lys Met 



Phe Tyr Val His 

180 

Val Lys Val lie 
195 

Gly His Asp Tyr 
210 

Thr His Arg Leu 

225 

Val Glu Glu Glu 



Tyr Leu Phe Tyr 

260 

lie Arg Arg Leu 
275 

Ala Leu Lys Ala 
290 



Val Asp Leu Pro 

120 

Gly Ala- Gly His 
135 

Thr His Ala Val 
150 

Phe Thr Asp Leu 
165 

Asn lie Lys Glu 



Glu Pro Thr Asn 

200 

Val Thr Leu Leu 
215 

Leu Val Arg Gly 
230 

Phe lie Ala Ala 

245 

Val Ala Val Gly 



Arg Lys Lys Lys 

280 

Ala Arg Lys Glu 
295 



Val Tyr Ala Gly 



Leu Glu Gly Thr 

140 

lie Thr Ala His 
155 

Thr Lys Leu Lys 
170 

Val Met Ala Tyr 
185 

Phe Asp Asp Leu 



Thr Cys Thr Pro 

220 

His Arg lie Pro 
235 

Asn Lys Leu Ser 
250 

Leu lie Val lie 
265 

Lys Gin Pro Glu 



Val Lys Val Glu 

300 



Thr Ala Glu Glu 
125 

Ser Leu Pro lie 



Thr Gly Leu Pro 

160 

Val Gly Asp Lys 
175 

Gin Val Asp Gin 
190 

Leu lie Val Pro 
205 

Tyr Met lie Asn' 



Tyr Val Ala Glu 

240 

His Leu Tyr Arg 
255 

Leu Leu Trp lie 
270 

Lys Ala Leu Lys 
285 

Asp Gly Gin Gin 



<210> 27 
<211> 915 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 27 

atgctgataa aaatggtaaa aacaaaaaag caaaaacgaa ataatctcct attaggagtg 60 



35 
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gtatttttca ttggaatggc ggtaatggcg 
gtggaatcaa atcaacaaat tgctgacttt 
gacattgatg aacgaatgaa attggcacaa 
agtggcgatc cttggtcgga agaaatgaag 
ttagaaatcc atgagcggat ggggcatgtg 
gtttatgctg gtactgctga agaggtattg 
tctctgccga tcggaggcaa ttcgacccat 
acagctaaga tgtttacgga tttgaccaaa 
aatatcaagg aagtgatggc ctatcaagtg 
tttgatgatt tattgattgt accaggtcat 
tacatgatca atacccatcg tctattggtt 
gttgaggaag aatttattgc agcaaacaaa 
gtggcagttg gtttgattgt gattctttta 
aaacaaccgg aaaaggcttt gaaggcgctg 
gatggacaac agtag 

<210> 28 
<211> 2199 
<212> DNA 

<213> Enterococcus faecalis 
<400> 28 

actaaaattc gtttacttta tgcatttaaa 
aaatgaggcg aatgttgata acggtcatac 
tgtgacaggt gggaaacgtt tcattaaagt 
ggcgggagct tcctttgtcg tccgtgatca 
cgatgaaaca acgaaagcag caacttgggt 

36 



tatccgctgg tgtctcgctt gtattatcga 120 
gataaggaaa aagcaacgtt ggatgaggct 18 0 
gccttcaatg actctttgaa taatgtagtg 240 
aaaaaagggc gagcagagta tgcacgtatg 300 
gaaatccccg ttattgacgt ggatttgccg 360 
cagcaagggg ctgggcatct agagggaact 420 
gcggtgatta cggcacatac aggtttgcca 480 
cttaaagttg gggataagtt ttatgtgcac 54 0 
gatcaagtaa aggtgattga gccgacgaac 600 
gattatgtga ccttgctgac ttgtacgcea 660 
cgggggcatc ggataccgta cgtagcagag 720 
ctcagtcatc tctatcgcta cctgttttat 780 
tggattattc gacgcttgcg caagaagaaa 840 
aaagcagcaa ggaaggaagt gaaggtggag 900 

915 



tgaaaaagca gatcctacga aaggctttaa 60 
cgacgaccaa acaccaccaa ctgttgaagt 1Z0 
cgatggcgat gtgacagcga cacaagcctt 180 
aaacagcgac acagcaaatt atttgaaaat 240 
gaaaacaaaa gctgaagcaa ctacttttac 300 
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aacaacggct gatggattag ttgatatcac 
agaaactgta gctcctgatg attatgtctt 
tgaacaatca tatggcacaa cagaaaacct 
caaaggtacc ttaccttcaa caggtggcaa 
agtcttgcta cttattgcag gagtctactt 
tctagcatca ccgaagaaat ttttagaaaa 
gctctcatgc tttattttta aggaggaagc 
tgatggtttt atgattcttt tactgattat 
tagcgatgca ttaaataact atctggatca 
aagccaagaa aacaccaaag aaatggctga 
agaattagcg aaaaaaggca gcaatcctgg 
aacgaaaaaa ccagacaaat cctattttga 
aaaaataaat gtccgtttac caatttttga 
aagctccttg ttagaaggaa cctcctatcc 
ttcaggccat cgtggtctcc ctcaagccaa 
aggcgatgaa ttttatatcg aagtcaatgg 
aaaaaccgtt gaaccaactg atacaaaaga 
cactttatta acttgcacac cgtatatgat 
tcgtatccca tatcaaccag aaaaagcagc 
aaatttacta ttatggacat tacttttaat 
tatctggtac aagcgacgga aaaagacgac 
taaacatact aaaaaaaaga gtaaaaaaat 
gcataattga accagagaaa cagaagtatt 
taaaaagcga caaagggcca atcaatcgac 
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agggcttaaa tacggtacct attatttaga 360 
gttaacaaat cggattgaat ttgtggtcaa 420 
agtttcacca gaaaaagtac caaacaaaca 480 
aggaatctac gtttacttag gaagtggcgc 540 
tgctagacgt agaaaagaaa atgcttaatt 600 
acaaagagcc tgggccaatc actgtcccag 660 
aatgaagtca aaaaagaaac gtcgtatcat 720 
tggaataggt gcatttgcgt atccttttgt 780 
acaaattatc gctcattatc aagcaaaagc 840 
acttcaagaa aaaatggaaa agaaaaacca 900 
attagatcct ttttctgaaa „ cgcaaaaaac 960 
aagtcatacg attggtgttt taaccattcc 1020 
taaaacgaat gcattgctat tggaaaaagg 1080 
tacaggtggt acgaatacac atgcggtcat 1140 
attatttaca gatttgccag aattaaaaaa 1200 
gaagacgctt gcttatcaag tagatcaaat 1260 
tttacacatt gagtctggcc aagatctcgt 1320 
aaacagtcat cggttattag ttcgaggaca 1380 
agcggggatg aaaaaagtgg cacaacaaca 14 40 
tgcctgtgcg ttaattatta gcggcttcat 1500 
cagaaaacca aagtagtatg acgaaaaggc 1560 
agcttttcaa tttttaatcc tccttatcgt 1620 
aacgaaataa ctaaaagagc aagccctgaa 1680 
tgtttaaatt cctgccaagt ttggattttt 1740 
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ctgttttttt tcgcgctatc ctcaagcgtg agtaaataat tcaatagtaa gaggagtagc 1800 
aacaccgtga aatcatttgt ggtaaaaagc acatgtaaaa atagaatgac aaagacaaca 18 60 
cgggataaca ctcgattccg caaaattaaa aataacttag cacgcataat aaaccaccat 1920 ■ 
ttcttatcag agataatgaa tctgtttttg tctactcttt agttatatca taaaattctt 1980 
aataatgaaa aaatgactcg agaaaataat tgaaaaaagt tttttttcct gaatcattat 2040 
tttcgtaaat aaagaataaa cgtgttactc ttggcttatc aaatttggaa ggagtgttaa 2100 
aaatgaaata tctggatatt attgctttaa ttttattgat tgtcggaggt ttaaactggt 2160 
tattagttgg tgcatttaat tttgatttag ttgcaacaa 2199 



<210> 29 
<211> 284 
<212> PRT 

<213> Enterococcus faecalis 
<400> 29 

Met Lys Ser Lys Lys Lys Arg Arg He He Asp Gly Phe Met He Leu 
15 10 15 

Leu Leu He He Gly He Gly Ala Phe Ala Tyr Pro Phe Val Ser Asp 

20 25 30 

Ala Leu Asn Asn Tyr Leu Asp Gin Gin He He Ala His Tyr Gin Ala 

35 40 45 

Lys Ala Ser Gin Glu Asn Thr Lys Glu Met Ala Glu Leu Gin Glu Lys 
50 55 60 

Met Glu Lys Lys Asn Gin Glu Leu Ala Lys Lys Gly Ser Asn Pro Gly 
65 70 75 80 

Leu Asp Pro Phe Ser Glu Thr Gin Lys Thr Thr Lys Lys Pro Asp Lys 

85 90 95 

Ser Tyr Phe Glu Ser His Thr He Gly Val Leu Thr He Pro Lys He 

100 105 HO 

Asn Val Arg Leu Pro He Phe Asp Lys Thr Asn Ala Leu Leu Leu Glu 
115 120 . 125 

Lys Gly Ser Ser Leu Leu Glu Gly Thr Ser Tyr Pro Thr Gly Gly Thr 

38 
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130 135 140 

Asn Thr His Ala Val lie Ser Gly His Arg Gly Leu Pro Gin Ala Lys 
145 150 155 160 

Leu Phe Thr Asp Leu Pro Glu Leu Lys Lys Gly Asp Glu Phe Tyr lie 

165 170 175 

Glu Val Asn Gly Lys Thr Leu Ala Tyr Gin Val Asp Gin lie Lys Thr 

180 185 190 

Val Glu Pro Thr Asp Thr Lys Asp Leu His lie Glu Ser Gly Gin Asp 
195 200 205 

Leu Val Thr Leu Leu Thr Cys Thr Pro Tyr Met lie Asn Ser His Arg 
210 215 220 

Leu Leu Val Arg Gly His Arg lie Pro Tyr Gin Pro Glu Lys Ala Ala 

225 230 235 240 

Ala Gly Met Lys Lys Val Ala Gin Gin Gin Asn Leu Leu Leu Trp Thr 

245 250 255 

Leu Leu Leu lie Ala Cys Ala Leu lie lie Ser Gly Phe lie lie Trp 

260 265 270 

Tyr Lys Arg Arg Lys Lys Thr Thr Arg Lys Pro Lys 

275 280 



<210> 30 
<211> 855 
<212> DNA 

<213> Enterococcus faecalis 
<400> 30 

atgaagtcaa aaaagaaacg tcgtatcatt gatggtttta tgattctttt actgattatt 60 
ggaataggtg catttgcgta tccttttgtt agcgatgcat taaataacta tctggatcaa 120 
caaattatcg ctcattatca agcaaaagca agccaagaaa acaccaaaga aatggctgaa 180 
cttcaagaaa aaatggaaaa gaaaaaccaa gaattagcga aaaaaggcag caatcctgga 240 
ttagatcctt tttctgaaac gcaaaaaaca acgaaaaaac cagacaaatc ctattttgaa 300 
agtcatacga ttggtgtttt aaccattcca aaaataaatg tccgtttacc aatttttgat 360 
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aaaacgaatg cattgctatt ggaaaaagga agctccttgt tagaaggaac ctcctatcct 420 

acaggtggta cgaatacaca tgcggtcatt tcaggccatc gtggtctccc tcaagccaaa 4 80 

ttatttacag atttgccaga attaaaaaaa ggcgatgaat tttatatcga agtcaatggg 540 

aagacgcttg cttatcaagt agatcaaata aaaaccgttg aaccaactga tacaaaagat 600 

ttacacattg agtctggcca agatctcgtc actttattaa cttgcacacc gtatatgata 660 

aacagtcatc ggttattagt tcgaggacat cgtatcccat atcaaccaga aaaagcagca 720 

gcggggatga aaaaagtggc acaacaacaa aatttactat tatggacatt acttttaatt 780 

gcctgtgcgt taattattag cggcttcatt atctggtaca agcgacggaa aaagacgacc 840 

agaaaaccaa agtag ^55 



<210> 31 
<211> 2687 
<212> DNA 

<213> Corynebacterium diphtheriae 
<400> 31 

gtggtccgga gtatgacaag aacgctccgg ttcaggtaaa cggcactggt aacggtaacg 60 
atctcgtggt cacctctgac aagaacggca acgtccactt cgagggcctg ttcgtctccg 120 
acgaccagaa tgatccggga aagtcagctg cgcagcgctg ctacgtcctc gtcgagaccg 18 0 
aggccccgac gggcttcgtt actccgaaag atgggacggt cttcccagtt gctgtaaaga 240 
ttggacagac tgctaccact acctacgacg caaaggtcga gaacgtcaag cgcgataccc 300 
ctgacctgcc gctgaccggt ggcaagggtg tgctgttcct gatgattgcc ggtggtctgt 360 
tgctgctggt tgctgttggt gctggtttcg tctttgtacg ccgtatcaac gagtaattga 420 
tttgtcgcgt gattaaataa tcgcgttgcg ccgcccaatg cagggcatca aatgccccgc 480 
cggcgggcat aaacgccggc ggggtgcggt ggctttccac cgcaccccca cattctttgt 54 0 
cagagatttg ctgtttggcc tgtgccaccc ggcatccccc tatatgagaa acggacgtac 600 
ctgtcatggt taccaccgcg tcaccgcgct ctaccggacc ggataaccca gacgcgcaac 660 
caaagcgtcg ttgggtcttt tccggactcg cattgtttgc gtgtataacg gcgctagccg 720 

40 
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gcctcatgtt ggggttgtat ccatctactg cagcgtggtt taacgcccgc gaacaggcca 780 
aactggtaga tctctatgat tccaaaattg aaaatgcaac ccctcttagc gcggaacaat 840 
tacttgaact cgcgcaccgt tataacgacc gcctgaccgt aggcgctgct ctcgatccct 900 
gggctaacgt cccccgcgga gcgggcaaag aagacggcga cggtatggcc tataaagacc 960 
agttgcgtgt tgaccgtacg gatgtcatgg ctcgtatacg tatcccctct atcaaggtgg 1020 
atctaccgat ctatcacggc acgagcgata acactctaaa gaagggcgct ggccatttgg 1080 
aaggtacctc gttaccggtg ggaggaccac gcacccattc cgttatcact gcacaccgtg 1140 
gcttagctga ggccaccatg ttcactaatc tcaacaaggt tggggtaggg gatagattca 1200 
ccattgaggt gatgggcgaa gtccttacgt atgaagtgcg tgaaactcgt gtggtcagcc 1260 
cagaggacac taggttcctg caaactcaag acgatcgtga ccttgtcaca ctcgttactt 1320 
gtactccgtt gggcatcaat acacatcgca ttctggtgac agctgagcgc attactccca 1380 
ccccgcaatc cgatatcgat gcagcacgtc aagcttccca aatcggcttc ccttggtggg 1440 
cggtcatttt cgcagtggga tttagcttta tcgccttgtt cttctggcgt tcgggttaca 1500 
tgattcctcc aaagaagaag gaagaagaca tcgaaagcga agctgatggc gatgaactct 15 60 
gaaacggcgg ggaaggaacc caacgtggtc agtaccgacg ctaaacactc caccggtacc 1620 
agttccaatg cgggtaccgg tgagagctca gcgaaaaaga aagcgcagac ggcaattgct 1680 
gcgatagtca tgcttttgtg cggactgtta gggctggtga ttctgttcta tccagtcgtg 1740 
tccactcaac ttaacaatta tgaacagtct aaactcgccc gacagtttgg tgcagacgct 1800 
gcccaagctg accctgccgt agttgctgct gctcttgatg ctgcccatgc ctacaacgat 18 60 
tcgctagaaa atggacccct gcaggatccg tggaccggtg gagatagcac taaggatcct 1920 
gcctatcagg catacgagaa actcttaggg gaatatccgg cgatggctca gatctctatc 1980 
ccggctattt ccgtgaacct tcccatttac cacgggacaa gcgacgccac actcctcaaa 204 0 
ggtgttgggc acctttacgg tactgcgcta cccgttggtg gactggggac gcgttcggtt 2100 
ctaacagcgc attcaggtat ccaaaaatcg accttctttg acaatttaga aaaggtcaaa 2160 

41 
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aagggtgacg ccatttatgt acgcaatatt ggtgagaccc tgaaatacca agtacgcgac 2220 
atcgaaatca tccgtccagc ggagattgac cgtatccagc caatcccaga ccgagactta 2280 
attaccctcg tgacctgtac accctatgga atcaataccc ataggctttt ggttactgcc 2340 
gaacgtgtcc ctatggaacc cggtgaggcg gaccgtgcat ttgccggtga cggaattgtc 24 00 
tggcagtggt ggatgaagct agctatcggt gtgttggtgg tcatccttct cctaactggg 24 60 
tggctcatta tccgtatttt gcgagctagg aaattcgcga agaaaacagc tggagcagac 2520 
gctgctaaat ctgttgaacc tggtgatatt gaggcgtcgc taagcgcttc agcggccgag 2580 
gagtcccagt aatatgcaga aaccaatttc cccaacacat gcaaacaccc aagcagtcgc 2640 
ccattcctga aaggacgccc tactatgaag aagactcact tgttccg 2687 



<210> 32 
<211> 348 
<212> PRT 

<213> Corynebacterium diphtheriae 
<400> 32 

Met Ala Met Asn Ser Glu Thr Ala Gly Lys Glu Pro Asn Val Val Ser 

15 10 15 

Thr Asp Ala Lys His Ser Thr Gly Thr Ser Ser Asn Ala Gly Thr Gly 

20 25 30 

Glu Ser Ser Ala Lys Lys Lys Ala Gin Thr Ala lie Ala Ala lie Val 

35 40 45 

Met Leu Leu Cys Gly Leu Leu Gly Leu Val lie Leu Phe Tyr Pro Val 
50 55 60 

Val Ser Thr Gin Leu Asn Asn Tyr Glu Gin Ser Lys Leu Ala Arg Gin 
65 70 75 80 

Phe Gly Ala Asp Ala Ala Gin Ala Asp Pro Ala Val Val Ala Ala Ala 

85 90 95 

Leu Asp Ala Ala His Ala Tyr Asn Asp Ser Leu Glu Asn Gly Pro Leu 

100 105 HO 

Gin Asp Pro Trp Thr Gly Gly Asp Ser Thr Lys Asp Pro Ala Tyr Gin 
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115 

Ala Tyr Glu Lys 
130 

lie Pro Ala lie 
145 

Ala Thr Leu Leu 



Val Gly Gly Leu 

180 

Gin Lys Ser Thr 
195 

Ala lie Tyr Val 
210 

Asp lie Glu lie 

225 

Pro Asp Arg Asp 



Asn Thr His Arg 

260 

Gly Glu Ala Asp 
275 

Trp Met Lys Leu 
290 

Gly Trp Leu lie 
305 

Thr Ala Gly Ala 



Ala Ser Leu Ser 

340 



<210> 33 
<21i> 1047 
<212> DNA 



120 

Leu Leu Gly Glu 
135 

Ser Val Asn Leu 
150 

Lys Gly Val Gly 
165 

Gly Thr Arg Ser 



Phe Phe Asp Asn 

200 

Arg Asn lie Gly 
215 

lie Arg Pro Ala 

230 

Leu lie Thr Leu 
245 

Leu Leu Val Thr 



Arg Ala Phe Ala 

280 

Ala lie Gly Val 

295 

lie Arg lie Leu 
310 

Asp Ala Ala Lys 
325 

Ala Ser Ala Ala 



Tyr Pro Ala Met 

140 

Pro lie Tyr His 
155 

His Leu Tyr Gly 
170 

Val Leu Thr Ala 
18 5 

Leu Glu Lys Val 



Glu Thr Leu Lys 

220 

Glu lie Asp Arg 
235 

Val Thr Cys Thr 
250 

Ala Glu Arg Val 
265 

Gly Asp Gly lie 



Leu Val Val -lie 

300 

Arg Ala Arg Lys 
315 

Ser Val Glu Pro 
330 

Glu Glu Ser Gin 
345 



125 

Ala Gin He Ser 



Gly Thr Ser Asp 

160 

Thr Ala Leu Pro 
175 

His Ser Gly He 
190 

Lys Lys Gly Asp 
205 

Tyr Gin Val Arg 

He Gin Pro He 

240 

Pro Tyr Gly He 
255 

Pro Met Glu Pro 
270 

Val Trp Gin Trp 
285 

Leu Leu Leu Thr 



Phe Ala Lys Lys 

320 

Gly Asp He Glu 
335 
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<213> Corynebacterium diphtheriae 
<400> 33 

atggcgatga actctgaaac ggcggggaag gaacccaacg tggtcagtac cgacgctaaa 60 
cactccaccg gtaccagttc caatgcgggt accggtgaga gctcagcgaa aaagaaagcg 120 
cagacggcaa ttgctgcgat agtcatgctt ttgtgcggac tgttagggct ggtgattctg 18 0 
ttctatccag tcgtgtccac tcaacttaac aattatgaac agtctaaact cgcccgacag 240 
tttggtgcag acgctgccca agctgaccct gccgtagttg ctgctgctct tgatgctgcc 300 
catgcctaca acgattcgct agaaaatgga cccctgcagg atccgtggac cggtggagat 360 
agcactaagg atcctgccta tcaggcatac gagaaactct taggggaata tccggcgatg 420 
gctcagatct ctatcccggc tatttccgtg aaccttccca tttaccacgg gacaagcgac 480 
gccacactcc tcaaaggtgt tgggcacctt tacggtactg cgctacccgt tggtggact'g 540 
gggacgcgtt cggttctaac agcgcattca ggtatccaaa aatcgacctt ctttgacaat 600 
ttagaaaagg tcaaaaaggg tgacgccatt tatgtacgca atattggtga gaccctgaaa 660 
taccaagtac gcgacatcga aatcatccgt ccagcggaga ttgaccgtat ccagccaatc 720 
ccagaccgag acttaattac cctcgtgacc tgtacaccct atggaatcaa tacccatagg 780 
cttttggtta ctgccgaacg tgtccctatg gaacccggtg aggcggaccg tgcatttgcc 840 
ggtgacggaa ttgtctggca gtggtggatg aagctagcta tcggtgtgtt ggtggtcatc 900 
cttctcctaa ctgggtggct cattatccgt attttgcgag ctaggaaatt cgcgaagaaa 960 
acagctggag cagacgctgc taaatctgtt gaacctggtg atattgaggc gtcgctaagc 1020 
gcttcagcgg ccgaggagtc ccagtaa 1047 

<210> 34 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Consensus/Streptococcus pyogenes 

44 
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<220> 

<223> X in position 12 can either be a S/T. 
<220> 

<223> X in position 18 can either be a R/K. 
<400> 34 

Thr Leu Leu Thr Cys Thr Pro Tyr Met lie Asn Xaa His Arg Leu Leu 
15 10 15 

Val Xaa Gly 



<210> 35 
<211> 19 
<212> PRT 

<213> Corynebacterium diphtheriae 
<400> 35 

Thr Leu Val Thr Cys Thr Pro Tyr Gly He Asn Thr His Arg Leu Leu 
15 10 15 

Val Thr Ala 



<210> 36 
<211> 19 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 36 

Thr Leu Val Thr Cys Thr Pro Tyr Gly Val Asn Thr Lys Arg Leu Leu 
1 5 10 15 

Val Arg Gly 



<210> 37 
<211> 150 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 37 

He Glu Asn Asn Asp lie Met Gly Tyr Val Glu Val Pro Ser He Lys 
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1 

Val Thr Leu Pro 

20 

Gly Ala Gly His 

35 

Thr His Thr Val 
50 

Phe Thr Asn Leu 
65 

Val Leu Asn Lys 



Glu Pro Asp Gin 

100 

Ala Thr Leu Val 
115 

Leu Val Arg Gly 
130 

Ala Lys Lys Ala 
145 



5 

lie Tyr His Tyr 



Leu Phe Gly Ser 

40 

lie Ser Ala His 

55 

Asn Leu Val Lys 
70 

Val Leu Ala Tyr 
85 

Val Thr Ser Leu 



Thr Cys Thr Pro 

120 

His Arg lie Ala 
135 

Met Lys 
150 



10 

Thr Thr Asp Glu 
25 

Ala Leu Pro Val 



Arg Gly Leu Pro 

60 

Lys Gly Asp Thr 

75 

Lys Val Asp Gin 
90 

Ser Gly Val Met 
105 

Tyr Gly Val Asn 



Tyr His Tyr Lys 

140 



15 

Val Leu Thr Lys 
30 

Gly Gly Asp Gly 
45 

Ser Ala Glu Met 



Phe Tyr Phe Arg 

80 

lie Leu Thr Val 

95 

Gly Lys Asp Tyr 
110 

Thr Lys Arg Leu 

125 

Lys Tyr Gin Gin 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1, 2, 25, 26 and patially 11-24, 35-51 

Streptococcal polypeptide EmaA comprising SEQ ID N0:2, 
analogs, variants and fragments thereof; vaccine, and 
pharmaceutical compositions comprising the same; antibody 
and pharmaceutical composition thereof and cell line 
producing the antibody; nucleic acid of SEQ ID N0:1 encoding 
the polypeptide and variants thereof; vector and host cell 
comprising it and nucleic acid vaccine; use of nucleic acid 
and antibody in diagnosis and use of the pharmaceutical 
compositions for prevention and treatment of infection. 



2. Claims: 3, 4, 27, 28 and partially 11-24, 35-51 

Streptococcal polypeptide EmaB comprising SEQ ID N0:4, 
analogs, variants and fragments thereof; vaccine, and 
pharmaceutical compositions comprising the same; antibody 
and pharmaceutical composition thereof and cell line 
producing the antibody; nucleic acid of SEQ ID NO: 3 encoding 
the polypeptide and variants thereof; vector and host cell 
comprising it and nucleic acid vaccine; use of nucleic acid 
and antibody in diagnosis and use of the pharmaceutical 
compositions for prevention and treatment of infection. 



3. Claims: 5, 6, 29, 30 and partially 11-24, 35-51 

Streptococcal polypeptide EmaC comprising SEQ ID NO: 6, 
analogs, variants and fragments thereof; vaccine, and 
pharmaceutical compositions comprising the same; antibody 
and pharmaceutical composition thereof and cell line 
producing the antibody; nucleic acid of SEQ ID N0:5 encoding 
the polypeptide and variants thereof; vector and host cell 
comprising it and nucleic acid vaccine; use of nucleic acid 
and antibody in diagnosis and use of the pharmaceutical 
compositions for prevention and treatment of infection. 



4. Claims: 7, 8, 31, 32 and partially 11-24, 35-51 

Streptococcal polypeptide EmaD comprising SEQ ID NO: 8, 
analogs, variants and fragments thereof; vaccine, and 
pharmaceutical compositions comprising the same; antibody 
and pharmaceutical composition thereof and cell line 
producing the antibody; nucleic acid of SEQ ID N0:7 encoding 
the polypeptide and variants thereof; vector and host cell 
comprising it and nucleic acid vaccine; use of nucleic acid 
and antibody in diagnosis and use of the pharmaceutical 
compositions for prevention and treatment of infection. 
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5. Claims: 9, 10, 33, 34 and partially 11-24, 35-51 

Streptococcal polypeptide EmaE comprising SEQ ID NO: 10, 
analogs, variants and fragments thereof; vaccine, and 
pharmaceutical compositions comprising the same; antibody 
and pharmaceutical composition thereof and cell line 
producing the antibody; nucleic acid of SEQ ID N0:9 encoding 
the polypeptide and variants thereof; vector and host cell 
comprising it and nucleic acid vaccine; use of nucleic acid 
and antibody in diagnosis and use of the pharmaceutical 
compositions for prevention and treatment of infection. 



6. Claims: 52-54 

Streptococcal Ema polypeptide comprising SEQ ID N0:23 and 
nucleic acid encoding it 



7. Claims: 55-57 

Streptococcal Ema polypeptide comprising SEQ ID NO: 26 and 
nucleic acid encoding it. 



8. Claims: 58, 59 

Streptococcal Ema polypeptide comprising SEQ ID NO: 37 and 
nucleic acid encoding it. 



9. Claims: 60-62 

Enterococcal Ema polypeptide compriding SEQ ID NO: 29 and 
nucleic acid encoding it. 



10. Claims: 63-65 

Corynebacteri urn Ema polypeptide and nucleic acid encoding it. 



11. Claims: 66, 67 and partially 71 

Polypeptide comprising SEQ ID N0:34 



12. Claim : 68 and partially 71 

Polypeptide comprising SEQ ID N0:35 



13. Claims: 69, 70 and partially 71 

Polypeptide comprising SEQ ID NO: 36 
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